FIGURE 1 



CCAGGTCCAACTGCACCTCGGTTCTATCGATTGAATTCCCCGGGGATCCTCTAGAGATCCCT 
CGACCTCGACCCACGCGTCCGCCAAGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGGA 
CAGGCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGC 
AAGGGCTAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGC 
TCCAGCAGCATCAGCAGCCCCCAGGACCGGGGGAGGCACAGGTGGCCCCCACCACCCGGAGG 
AGCAGCTCCTGCCCCTGTCCGGGGGATGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGA 
AGGCCACCCCGCCTGGAGGCACAGGCCATGAGGGGCTCTCAGGAGGTGCTGCTGATGTGGCT 
TCTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTACCGGCCCGGCCGTTAGGGTGTGTGCT 
GTCCCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCC 
TCACCACCTGCGACGGGCACCGGGCCTGCAGCACCTACCGAACCATTTATAGGACCGCCTAC 
CGCCGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAG 
GACCAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCATGCCGGAACGGAG 
GGAGCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCAGGATGGCGGGGTGACACTTGCCAG 
TCAGATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTGCATCAACACCGC 
CGGCAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCCTGTCTGCAGACGGTACACTCTGTG 
TGCCCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGGACAGTGCAATGAAG 
GAAGAAGTGCAGAGGCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCT 
GGCCCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCC 
TCCTGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTC 
CTGGAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGACTCGTGACTGCCCAGCGCCCCAGG 
CTGGACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTC 
CAGAAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTTTTCCTCCTC 
CCCTTCCCTCGGGAGGGTCCCCAGACCCTGGCATGGGATGGGCTGGGATTTTTTTTGTGAAT 
CCAGCCCTGGCTACCCCCACCCTGGTTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCA 
GCTGAGGGAAGGTACGAGTTCCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCC 
CGGAGGCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAA 
CGTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCT 
AGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATGGT 
TACAAAT 



FIGURE 2 



MTDSPPPGHPEEKATPPGGTGHEGLSGGAADVASGVGSGRHRARLPARPLGCVLSRAHGDPV 
SESFVQRVYQPPLTTCDGHRACSTYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLEGAC 
GAAICQPPCRNGGSCVQPGRCRCPAGWRGDTCQSDVDECSARRGGCPQRCINTAGSYWCQCW 
EGHSLSADGTLCVPKGGPPRVAPNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLAS 
QALEHGLPDPGSLLVHSFQQLGRIDSLSEQISFLEEQLGSCSCKKDS 

Signal sequence: 

amino acids 1-19 

CAMP- and cGMP- dependent protein kinase phosphorylation sites. 

amino acids 93-97, 270-274 

N-myristoylation sites. 

amino acids 19-25, 78-84, 97-103, 100-106, 103-109, 157-163, 
191-197, 265-271 

Amidation site. 

amino acids 26-3 0 

Aspartic acid and asparagine hydroxylation site. 

amino acids 152-164 

Cell attachment sequence. 

amino acids 130-133 

EGF-like domain cysteine pattern signature. 

amino acids 123-135 



FIGURE 3 



CGCTCGCCCCGTCGCCCCTCGCCTCCCCGCAGAGTCCCCTCGCGGCAGCAGATGTGTGTGGG 
GTCAGCCCACGGCGGGGACTATGGTGAAATTCCCGGCGCTCACGCACTACTGGCCCCTGATC 
CGGTTCTTGGTGCCCCTGGGCATCACCAACATAGCCATCGACTTCGGGGA6CAGGCCTTGAA 
CCGGGGCATTGCTGCTGTCAAGGAGGATGCAGTCGAGATGCTGGCCAGCTACGGGCTGGCGT 
ACTCCCTCATGAAGTTCTTCACGGGTCCCATGAGTGACTTCAAAAATGTGGGCCTGGTGTTT 
GTGAACAGCAAGAGAGACAGGACCAAAGCCGTCCTGTGTATGGTGGTGGCAGGGGCCATCGC 
TGCCGTCTTTCACACACTGATAGCTTATAGTGATTTAGGATACTACATTATCAATAAACTGC 
ACCATGTGGACGAGTCGGTGGGGAGCAAGACGAGAAGGGCCTTCCT6TACCTCGCCGCCTTT 
CCTTTCATGGACGCAATGGCATGGACCCATGCTGGCATTCTCTTAAAACACAAATACAGTTT 
CCTGGTGGGATGTGCCTCAATCTCAGATGTCATAGCTCAGGTTGTTTTTGTAGCCATTTTGC 
TTCACAGTCACCTGGAATGCCGGGAGCCCCTGCTCATCCCGATCCTCTCCTTGTACATGGGC 
GCACTTGTGCGCTGCACCACCCTGTGCCTGGGCTACTACAAGAACATTCACGACATCATCCC 
TGACAGAAGTGGCCCGGAGCTGGGGGGAGATGCAACAATAAGAAAGATGCTGAGCTTCTGGT 
GGCCTTTGGCTCTAATTCTGGCCACACAGAGAATCAGTCGGCCTATTGTCAACCTCTTTGTT 
TCCCGGGACCTTGGTGGCAGTTCTGCAGCCACAGAGGCAGTGGCGATTTTGACAGCCACATA 
CCCTGTGGGTCACATGCCATACGGCTGGTTGACGGAAATCCGTGCTGTGTATCCTGCTTTCG 
ACAAGAATAACCCCAGCAACAAACTGGTGAGCACGAGCAACACAGTCACGGCAGCCCACATC 
AAGAAGTTCACCTTCGTCTGCATGGCTCTGTCACTCACGCTCTGTTTCGTGATGTTTTGGAC 
ACCCAACGTGTCTGAGAAAATCTTGATAGACATCATCGGAGTGGACTTTGCCTTTGCAGAAC 
TCTGTGTTGTTCCTTTGCGGATCTTCTCCTTCTTCCCAGTTCCAGTCACAGTGAGGGCGCAT 
CTCACCGGGTGGCTGATGACACTGAAGAAAACCTTCGTCCTTGCCCCCAGCTCTGTGCTGCG 
GATCATCGTCCTCATCGCCAGCCTCGTGGTCCTACCCTACCTGGGGGTGCACGGTGCGACCC 
TGGGCGTGGGCTCCCTCCTGGCGGGCTTTGTGGGAGAATCCACCATGGTCGCCATCGCTGCG 
TGCTATGTCTACCGGAAGCAGAAAAAGAAGATGGAGAATGAGTCGGCCACGGAGGGGGAAGA 
CTCTGCCATGACAGACATGCCTCCGACAGAGGAGGTGACAGACATCGTGGAAATGAGAGAGG 
AGAATGAATAAGGCACGGGACGCCATGGGCACTGCAGGGACGGTCAGTCAGGATGACACTTC 

GAAAGAGGCCTTGATTTAAAGGTTTCGTGTCAATTCTCTAGCATACTGGGTATGCTCACACT 
GACGGGGGGACCTAGTGAATGGTCTTTACTGTTGCTATGTAAAAACAAACGAAACAACTGAC 
TTCATACCCCTGCCTCACGAAAACCCAAAAGACACAGCTGCCTCACGGTTGACGTTGTGTCC 
TCCTCCCCTGGACAATCTCCTCTTGGAACCAAAGGACTGCAGCTGTGCCATCGCGCCTCGGT 
CACCCTGCACAGCAGGCCACAGACTCTCCTGTCCCCCTTCATCGCTCTTAAGAATCAACAGG 
TTAAAACTCGGCTTCCTTTGATTTGCTTCCCAGTCACATGGCCGTACAAAGAGATGGAGCCC 
CGGTGGCCTCTTAAATTTCCCTTCTGCCACGGAGTTCGAAACCATCTACTCCACACATGCAG 
GAGGCGGGTGGCACGCTGCAGCCCGGAGTCCCCGTTCACACTGAGGAACGGAGACCTGTGAC 
CACAGCAGGCTGACAGATGGACAGAATCTCCCGTAGAAAGGTTTGGTTTGAAATGCCCCGGG 
GGCAGCAAACTGACATGGTTGAATGATAGCATTTCACTCTGCGTTCTCCTAGATCTGAGCAA 
GCTGTCAGTTCTCACCCCCACCGTGTATATACATGAGCTAACTTTTTTAAATTGTCACAAAA 
GCGCATCTCCAGATTCCAGACCCTGCCGCATGACTTTTCCTGAAGGCTTGCTTTTCCCTCGC 
CTTTCCTGAAGGTCGCATTAGAGC6AGTCACATGGAGCATCCTAACTTTGCATTTTAGTTTT 
TACAGTGAACTGAAGCTTTAAGTCTCATCCAGCATTCTAATGCCAGGTTGCTGTAGGGTAAC 
TTTTGAAGTAGATATATTACCTGGTTCTGCTATCCTTAGTCATAACTCTGCGGTACAGGTAA 
TTGAGAATGTACTACGGTACTTCCCTCCCACACCATACGATAAAGCAAGACATTTTATAACG 
ATACCAGAGTCACTATGTGGTCCTCCCTGAAATAACGCATTCGAAATCCATGCAGTGCAGTA 
TATTTTTCTAAGTTTTGGAAAGCAGGTTTTTTCCTTTAAAAAAATTATAGACACGGTTCACT 
AAATTGATTTAGTCAGAATTCCTAGACTGAAAGAACCTAAACAAAAAAATATTTTAAAGATA 
TAAATATATGCTGTATATGTTATGTAATTTATTTTAGGCTATAATACATTTCCTATTTTCGC 
ATTTTCAATAAAATGTCTCTAATACAAAAAA 



FIGURE 4 



MVKS-PALTHyWPLIRFLVPLGITNIAIDFGEQArjmGIAAVKEDAVEMIiASYGLAYSLMKFF 
TGPMSDFKl^GLVFVNSKiyDRTKAVLCMVVAGAIAAVFHTLIAYSDIiGYYIINKLHHVDESV 
GSKTRRAFLYLAAFPFMDAMAWTHAGILLKHKYSFLVGCASISDVIAQWFVAILLHSHLEC 
REPLLIPILSLYMGALVRCTTLCLGYYKNIHDIIPDRSGPELGGDATIRKMLSFWWPLALIL 
ATQRISRPIVNLFVSRDLGGSSAATEAVAILTATYPVGHMPYGWLTEIRAVYPAFDKNNPSN 
KLVSTSNTVTAAHIKKFTFVCMALSLTLCFVMFWTPNVSEKILIDIIGVDFAFAELCWPLR 
IFSFFPVPVTVRAHLTGWLMTLKKTFVLAPSSVLRIIVLIASLVVLPYLGVHGATLGVGSLL 
AGFVGESTMVAIAACYVYRKQKKKMENESATEGEDSAMTDMPPTEEVTDIVEMREENE 

Transmembrane domains: 

amino acids 86-106, 163-179, 191-205, 237-253, 327-343, 357-374, 
408-423, 431-445 
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CCTGACAGAAGTGCCCCGGAGCTGGGGGAGATNCAACATTAAGAAGATGCTGAGCTTCTGGT 
GCCNTTTGGCTCTAATTCTGGCCACACAGAGAANCAGTCGGCCTATTGTCAACCTCTTTGTT 
TCCCGGGACCTTGGTGGCAGTTCTGCAGCCACAGAGGCAGTGGCGATTTTGACAGCCACATA 
CCCTGTGGGTCACATGCCATACGGCTGGTTGACGGAAATCCGTGCTGTGTATCCTGCTTTCG 
ACAAGAATAACCCCAGCAACAAACTGGTGAGCACGAGCAACACAGTCACGGCGGCCCACATC 
AAGAAGTTCACCTTCGTCTGCATGGCTCTGTCACTCACGCTCTGTTTCGTGATGTTTTGGAC 
ACCCAACGTGTCTGNGAAAATCTTGATAGACATCATCGGAGTGGACTTTGCCTTTGCAGAAC 
TCTGTGTTGTTCCTTTGCGGATCTTCTCCTTCTTCCCAGTTCCAGTCACAGTGAGGGCGCAT 
CTCACCGGGTGGCTGATGACACTGAAGAAAACCTTCGTC 



FIGURE 6 



TGACGGAATCCCGGGCTGGGTATCCTGGTTTNGACAAGATAAACCCCCAGCAANAAATTGGG 
GAGCAGGGCAAAACAGTNACGGGCAGCCCACATCAAGAAGTTCACCTTNGTTTGNATGGNTC 
TGTCAACTCACGCTNTGTTTCGTGATGTTTTGGACACCCAAAGTGTTTGAGAAAATTTTGAT 
AGACATNATCGGAGTGGANTTTGCCTTTGCAGAANTTTGNGNTGTTCCTTTGCGGATTTTCT 
CCTTTTTCCCAGTTCCAGTCACAGNGAGGGCGCATCTCACCGGGNGGNTGATGACANTGAAG 
AAAACCTTTGTCCTTGCCCCCAGCTNTTTGGTGCGGATCATTGTCCTNATNGCCAGCCTTGT 
GGTCCTACCCTACCTGGGGGTGCACGGTGCGACCCTGGGCGTGGGTTCCCTCCTGGCGGGCA 



FIGURE 7 



TATTCCCAGTTCCGGTCACGGGGAGGGCGCATNTCACCGGGTGGCTGANGACACTGAAGAAA 
ACCTTNGTCCTTGCCCCCAGNTTTGTGNTGCGGATNATCGTCCTCATCGCCAGCCTNGTGGT 
CCTACCCTACCTGGGGGTGCACGGTGAGAC 



FIGURE 8 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGA 
GCCTGCTCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAG 
CTGCTGCCCCGCCAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCC 
TGGGGGTGCTGGTGTCCATCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTG 
CCCTGGGTGTGTGAGGAGGGGGCCGGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGG 
CTCCCTGCTTGGCTACCGCGCTGTCTACCGCATGTGCTTCGCCACGGCGGCCTTCTTCTTCT 
TCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCAGCCGGGACCCCCGGGCTGCCATCCAG 
AATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTCACCGTGGGTGCCTTCTACAT 
CCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGT6GGCTCCTTCCTCTTCA 
TCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGTGGCTGGGC 
AAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTCTT 
CTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCT 
GCCACGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCT 
GCTGTCCTGCCCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCAT 
CACCCTCTACACCATGTTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCA 
ACCCCCATTTGCCAACCCAGCTGGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAG 
ACCCAGT6GTGGGATGCCCCGAGCATTGTGGGCCTCATCATCTTCCTCCTGTGCACCCTCTT 
CATCAGTCTGCGCTCCTCAGACCACCGGCAGGTGAACAGCCTGATGCAGACCGAGGAGTGCC 
CACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGTGGCAGCCTGTGAGGGCCGGGCC 
TTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCCACTTCTGCCTGGTGCT 
GGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAGACCCGGAAGA 
TGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCTCCTC 
TACCTQTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGC TGAG GCAGCCT 
CACAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCT 
GCCCCCTCCCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTG 
CCCCTGAGCCGGGCCTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCA 
GAGCCCCATCCCCCCGCCACACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGT 
TGCCCATACTCAGCATCTCGGATGAAAGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGG 
CTGCTGGAGAGAGCGGGGAACTCCCACCACAGTGGGGCATCCGGCACTGAAGCCCTGGTGTT 
CCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTGGACTTCGTGCCTTACTGAGTCTCT 
AAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVE 
SQLYKLPWVCEEGAGI PTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLIiMIiCVS S SRD 
PRAAIQNGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSW 
NQRWLGKAEECDSRAWYAGLFFFTLLFYLLS lAAVALMFMYYTEPSGCHEGKVFI S LNLTFC 
VCVS lAAVLPKVQDAQPNSGLLQASVITLYTMFVTWSALSS I PEQKCNPHLPTQLGNETWA 
GPEGYETQWWDAPSIVGLIIFLLCTIiFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVA 
ACEGRAFDNEQDGVTYSYSFFHFCLVIASLHVmTLTNWYKPGETRKMISTWTAVWVKICAS 
WAGLLLYLWTLVAPLLLRNRDFS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 



rarrnrr 



FIGURE 10 



GAGCGAGGCCGGGGACTGAAGGTGTGGGTGTCGAGCCCTCTGGCAGAGGGTTAACCTGGGTC 
AAATGCACGGATTCTCACCTCGTACAGTTACGCTCTCCCGCGGCACGTCCGCGAGGACTTGA 
AGTCCTGAGCGCTCAAGTTTGTCCGTAGGTCGAGAGAAGGCC ATGG AGGTGCCGCCACCGGC 
ACCGCGGAGCTTTCTCTGTAGAGCATTGTGCCTATTTCCCCGAGTCTTTGCTGCCGAAGCTG 
TGACTGCCGATTCGGAAGTCCTTGAGGAGCGTCAGAAGCGGCTTCCCTACGTCCCAGAGCCC 
TATTACCCGGAATCTGGATGGGACCGCCTCCGGGAGCTGTTTGGCAAAGATGAACAGCAGAG 
AATTTCAAAGGACCTTGCTAATATCTGTAAGACGGCAGCTACAGCAGGCATCATTGGCTGGG 
TGTATGGGGGAATACCAGCTTTTATTCATGCTAAACAACAATACATTGAGCAGAGCCAGGCA 
GAAATTTATCATAACCGGTTTGATGCTGTGCAATCTGCACATCGTGCTGCCACACGAGGCTT 
CATTCGTTATGGCTGGCGCTGGGGTTGGAGAACTGCAGTGTTTGTGACTATATTCAACACAG 
TGAACACTAGTCTGAATGTATACCGAAATAAAGATGCCTTAAGCCATTTTGTAATTGCAGGA 
GCTGTCACGGGAAGTCTTTTTAGGATAAACGTAGGCCTGCGTGGCCTGGTGGCTGGTGGCAT 
AATTGGAGCCTTGCTGGGCACTCCTGTAGGAGGCCTGCTGATGGCATTTCAGAAGTACGCTG 
GTGAGACTGTTCAGGAAAGAAAACAGAAGGATCGAAAGGCACTCCATGAGCTAAAACTGGAA 
GAGTGGAAAGGCAGACTACAAGTTACTGAGCACCTCCCTGAGAAAATTGAAAGTAGTTTACG 
GGAAGATGAACCTGAGAATGATGCTAAGAAAATTGAAGCACTGCTAAACCTTCCTAGAAACC 
CTTCAGTAATAGATAAACAAGACAAGGACTGAAAGTGCTCTGAACTTGAAACTCACTGGAGA 
GCTGAAGGGAGCTGCCATGTCCGATGAATGCCAACAGACAGGCCACTCTTTGGTCAGCCTGC 
TGACAAATTTAAGTGCTGGTACCTGTGGTGGCAGTGGCTTGCTCTTGTCTTTTTCTTTTCTT 
TTTAACTAAGAATGGGGCTGTTGTACTCTCACTTTACTTATCCTTAAATTTAAATACATACT 
TATGTTTGTATTAATCTATCAATATATGCATACATGGATATATCCACCCACCTAGATTTTAA 
GCAGTAAATAAAACATTTCGCAAAAGATTAAAGTTGAATTTTACAGTTT 



FIGURE 11 

></usr/ seqdb2/sst/DNA/Dnaseqs .min/ss . DNA23318 
xsiibunit 1 of 1, 285 aa, 1 stop 
xMW: 32190, pi: 9.03, NX(S/T>: 2 

MEVPPPAPRSFLCRALCLFPRVFAAEAVTADSEVLEERQKRLPYVPEPYYPESGWDRLRELF 
GKDEQQRISKDLANICKTAATAGI IGWVYGGI PAFIHAKQQYI EQSQAE lYHNRFDAVQSAH 
RAATRGFIRYGWRWGWRTAVFVTIFNTWTSLNVYRNKDALSHFVIAGAVTGSLFRINVGLR 
GLVAGGIIGALLGTPVGGLLMAFQKYAGETVQERKQKDRKALHELKLEEWKGRLQVTEHIiPE 
KIESSLREDEPENDAKKIEALLNLPRNPSVIDKQDKD 

Important Features: 
Signal Peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 76-96 and 171-195 

N-glycosylatibn site: 

amino acids 153-156 
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CGGAAGTCCCTTGAGGAGCGTCAGAAGCGGCTTCCCTACGTCCCAGAGCCCTATTACCCGGA 
ATCTGGATGGGACCGCTCCGGGAGCTGTTTGGCAAAGATGAACAGCAGAGAATTTCAAAGGA 
CCTTGCTAATATCTGTAAGACGGCAGCTACAGCAGGCATCATTGGCTGGGTGTATGGGGGAA 
TACCAGCTTTTATTCATGCTAAACAACAATACATTGAGCAGAGCCAGGCAGAAATTTATCAT 
AACCGGTTTGATGCTGTGCAATCTGCACATCGTGCTGCCACACGAGGCTTCATTCGTTCATG 
GCTGGCGCCGAACC 
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TCAAGTTTGTCCGTAGGTCGAGAGAAGGCCATGGAGGTGCCGCCACCGGCACCGCGGAGCTT 
TTTTCTGTAGAGCATTGTGCCTATTTCCCCGAGTTTTTGCTGCCGAAGCTGTGACTGCCGAT 
TCGGAAGTCCTTGAGGAGCGTCAGAAGCGGCTTCCCTACGTCCCAGAGCCCTATTACCCGGA 
ATTTGGATGGGACCGCCTCCGGGAGCTGTTTGGCAAAGATGAACAGCAGAGAATTTCAAAGG 
ACCTTGCTGATATNTGTAAGACGGCAGCTACAGCAGGCATCATTGGCTGGGTGTATGGGGGA 
ATACCAGCTTTTATTCATGNTAAACAACAATACATTGAGCAGAGCCAGGCAGAAATTTATNA 
TAACC 



FIGURE 14 



GAGCCGCCGCCGCGCGCGCGCCGCGCACTGCAGCCCCAGGCCCCGGCCCCCCACCCACGTCT 
GCGTTGCTGCCCCGCCTGGGCCAGGCCCCAAAGGCAAGGACAAAGCAGCTGTCAGGGAACCT 
CCGCCGGAGTCGAATTTACGTGCAGCTGCCGGCAACCACAGGTTCCAA GATGG TTTGCGGGG 
GCTTCGCGTGTTCCAAGAACTGCCTGTGCGCCCTCAACCTGCTTTACACCTTGGTTAGTCTG 
CTGCTAATTGGAATTGCTGCGTGGGGCATTGGCTTCGGGCTGATTTCCAGTCTCCGAGTGGT 
CGGCGTGGTCATTGCAGTGGGCATCTTCTTGTTCCTGATTGCTTTAGTGGGTCTGATTGGAG 
CTGTAAAACATCATCAGGTGTTGCTATTTTTTTATATGATTATTCTGTTACTTGTATTTATT 
GTTCAGTTTTCTGTATCTTGCGCTTGTTTAGCCCTGAACCAGGAGCAACAGGGTCAGCTTCT 
GGAGGTTGGTTGGAACAATACGGCAAGTGCTCGAAATGACATCCAGAGAAATCTAAACTGCT 
GTGGGTTCCGAAGTGTTAACCCAAATGACACCTGTCTGGCTAGCTGTGTTAAAAGTGACCAC 
TCGTGCTCGCCATGTGCTCCAATCATAGGAGAATATGCTGGAGAGGTTTTGAGATTTGTTGG 
TGGCATTGGCCTGTTCTTCAGTTTTACAGAGATCCTGGGTGTTTGGCTGACCTACAGATACA 
GGAACCAGAAAGACCCCCGCGCGAATCCTAGTGCATTCCTTTGATGAGAAAACAAGGAAGAT 
TTCCTTTCGTATTATGATCTTGTTCACTTTCTGTAATTTTCTGTTAAGCTCCATTTGCCAGT 
TTAAGGAAGGAAACACTATCTGGAAAAGTACCTTATTGATAGTGGAATTATATATTTTTACT 
CTATGTTTCTCTACATGTTTTTTTCTTTCCGTTGCTGAAAAATATTTGAAACTTGTGGTCTC 
TGAAGCTCGGTGGCACCTGGAATTTACTGTATTCATTGTCGGGCACTGTCCACTGTGGCCTT 
TCTTAGCATTTTTACCTGCAGAAAAACTTTGTATGGTACCACTGTGTTGGTTATATGGTGAA 
TCTGAACGTACATCTCACTGGTATAATTATATGTAGCACTGTGCTGTGTAGATAGTTCCTAC 
TGGAAAAAGAGTGGAAATTTATTAAAATCAGAAAGTATGAGATCCTGTTATGTTAAGGGAAA 
TCCAAATTCCCAATTTTTTTTGGTCTTTTTAGGAAAGATTGTTGTGGTAAAAAGTGTTAGTA 
TAAAAATGATAATTTACTTGTAGTCTTTTATGATTACACCAATGTATTCTAGAAATAGTTAT 
GTCTTAGGAAATTGTGGTTTAATTTTTGACTTTTACAGGTAAGTGCAAAGGAGAAGTGGTTT 
CATGAAATGTTCTAATGTATAATAACATTTACCTTCAGCCTCCATCAGAATGGAACGAGTTT 
TGAGTAATCAGGAAGTATATCTATATGATCTTGATATTGTTTTATAATAATTTGAAGTCTAA 
AAGACTGCATTTTTAAACAAGTTAGTATTAATGCGTTGGCCCACGTAGCAAAAAGATATTTG 
ATTATCTTAAAAATTGTTAAATACCGTTTTCATGAAATTTCTCAGTATTGTAACAGCAACTT 
GTCAAACCTAAGCATATTTGAATATGATCTCCCATAATTTGAAATTGAAATCGTATTGTGTG 
GCTCTGTATATTCTGTTAAAAAATTAAAGGACAGAAACCTTTCTTTGTGTATGCATGTTTGA 
ATTAAAAGAAAGTAATGGAAG 



FIGURE 15 

></usr/seqdb2/sst/DNA/Dnaseqs ,min/ss . DNA39979 
xsubunit 1 of 1, 204 aa, 1 stop 
xMW: 22147, pi: 8.37, NX(S/T): 3 

MVCGGFACSKNCLCALNLLYTLVSLLLIGIAAWGIGFGLISSLRWGWIAVGIFLFLIALV 
GLIGAVKHHQVLLFFYMIILLLVFIVQFSVSCACIALNQEQQGQLLEVGWNNTASAKNDIQR 
NLNCCGFRSVNPNDTCLASCVKSDHSCSPCAPIIGEYAGEVLRFVGGIGLFFSFTEILGWL 
TYRYRNQKDPRANPSAFL 

Signal Peptide: 

amino acids 1-34 

Transmembrane domains: 

amino acids 47-63, 72-95 and 162-182 
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TGATTGGAGCTGTAAAAAANTCTTCAGGTGTTGTNATTTTTTTATATGATTATTCTGTAANT 
TGTATTTATTGTTCAGTTTTNTGTATCTTGCGCTTGTTTAGCCNTGAACCAGGAGCAACAGG 
GTCAGNTTNTGGAGGTTGGTTGGAACAATACGGCAAGTGCTCGAAATGACATCCAGAGAAAT 
NTAAACTGCTGTGGGTTCCGAAGTGTTAACCCAAATGACACCTGTNTGGCTAGCTGTGTTAA 
AAGTGACCACTNGTGCTCGCCATGTGCTCCAATCATAGGAGAATATGCTGGAGAGGTTTTGA 
GATTTGTTGGTGGCATTGGCCTGTTNTTCAGTTTTACAGAGATCCTGGGTGTTTGGCTGACC 
TACAGATACAGGAACCAG 
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AATCCCAAATTCCCCAATTTTTTTGGNCTTTTTAGGGAAAGATGTGTTGTGGTAAAAAGTGT 
TAGTATAAAAATGATAATTTACTTGTAGTCTTTTATGATTACACCAATGTATTCTAGAATAG 
TTATGTCTTAGGAAATTGTGGTTTAATTTTTGACTTTTACAGGTAAGTGCAAAGGAGAAGTG 
GTTTCATGAAATGTTCTAATGTATAATAACATTTACCTTCAGCCTCCCATCAGAATGGAACG 
AGTTTTGAGTAATCCAGGAAGTATATCTATATGATCTTGATATTGTTTTATATAATTTGAAG 
TCTAAAAGACTGCATTTTTAAACAAGTTAGTATTAATGCGTTGGCCCACGTAGCAAAAAGAT 
ATTTGATTATCTTAAAAATTGTTAAATACCGTTTTCATGAAAGTTCTCAGTATTGTAACAGC 
AACTTGTCAAACCTAAGCATATTTGAATATGATCTCCCATAATTTGAAATTGAAATCGTATT 
GTGTGGAGGAAATGGCAATCTTATGTGTGCTGAAGGACACAGTAAGAGCACCAAGTTGTGCC 
CCACTTGC 



FIGURE 18 

ATGATTATTCTGTTACTTGTATTTATTGTTCAGTTTTATGGTATCTTGCGCTTGTTTAGCCC 
CTGAAACCAGGAGCAACAGGGNWCAGCTTCCTGGAGGTTGGTTGGCAACAATCACGGCCAAG 
TGACTCCGCAAATGACATCCCAGAGAAATCCTAAACTGCTGTGGGTTCCGAAGTGTTAACCC 
AAATGACACCTGTCTGGCTNGCTGTGTTAAAAGTGACCACTCGTGCTCGCCATGTGCTCCAA 
TCATAGGAGAATATGC 



FIGURE 19 



CAGTCACCATGAAGCTGGGCTGTGTCCTCATGGCCTGGGCCCTCTACCTTTCCCTTGGTGTG 
CTCTGGGTGGCCCAGATGCTACTGGCTGCCAGTTTTGAGACGCTGCAGTGTGAGGGACCTGT 
CTGCACTGAGGAGAGCAGCTGCCACACGGAGGATGACTTGACTGATGCAAGGGAAGCTGGCT 
TCCAGGTCAAGGCCTACACTTTCAGTGAACCCTTCCACCTGATTGTGTCCTATGACTGGCTG 
ATCCTCCAAGGTCCAGCCAAGCCAGTTTTTGAAGGGGACCTGCTGGTTCTGCGCTGCCAGGC 
CTGGCAAGACTGGCCACTGACTCAGGTGACCTTCTACCGAGATGGCTCAGCTCTGGGTCCCC 
CCGGGCCTAACAGGGAATTCTCCATCACCGTGGTACAAAAGGCAGACAGCGGGCACTACCAC 
TGCAGTGGCATCTTCCAGAGCCCTGGTCCTGGGATCCCAGAAACAGCATCTGTTGTGGCTAT 
CACAGTCCAAGAACTGTTTCCAGCGCCAATTCTCAGAGCTGTACCCTCAGCTGAACCCCAAG 
CAGGAAGCCCCATGACCCTGAGTTGTCAGACAAAGTTGCCCCTGCAGAGGTCAGCTGCCCGC 
CTCCTCTTCTCCTTCTACAAGGATGGAAGGATAGTGCAAAGCAGGGGGCTCTCCTCAGAATT 
CCAGATCCCCACAGCTTCAGAAGATCACTCCGGGTCATACTGGTGTGAGGCAGCCACTGAGG 
ACAACCAAGTTTGGAAACAGAGCCCCCAGCTAGAGATCAGAGTGCAGGGTGCTTCCAGCTCT 
GCTGCACCTCCCACATTGAATCCAGCTCCTCAGAAATCAGCTGCTCCAGGAACTGCTCCTGA 
GGAGGCCCCTGGGCCTCTGCCTCCGCCGCCAACCCCATCTTCTGAGGATCCAGGCTTTTCTT 
CTCCTCTGGGGATGCCAGATCCTCATCTGTATCACCAGATGGGCCTTCTTCTCAAACACAT6 
CAGGATGTGAGAGTCCTCCTCGGTCACCTGCTCATGGAGTTGAGGGAATTATCTGGCCACCA 
GAAGCCTGGGACCACAAAGGCTACTGCTGAATAGAAGTAAACAGTTCATCCATGATCTCACT 
TAACCACCCCAATAAATCTGATTCTTTATTTTCTCTTCCTGTCCTGCACATATGCATAAGTA 
CTTTTACAAGTTGTCCCAGTGTTTTGTTAGAATAATGTAGTTAGGTGAGTGTAAATAAATTT 
ATATAAAGTGAGAATTAGAGTTTAGCTATAATTGTGTATTCTCTCTTAACACAACAGAATTC 
TGCTGTCTAGATCAGGAATTTCTATCTGTTATATCGACCAGAATGTTGTGATTTAAAGAGAA 
CTAATGGAAGTGGATTGAATACAGCAGTCTCAACTGGGGGCAATTTTGCCCCCCAGAGGACA 
TTGGGCAATGTTTGGAGACATTTTGGTCATTATACTTGGGGGGTTGGGGGATGGTGGGATGT 
GTGTCTACTGGCATCCAGTAAATAGAAGCCAGGGGTGCCGCTAAACATCCTATAATGCACAG 
GGCAGTACCCCACAACGAAAAATAATCTGGCCCAAAATGTCAGTTGTACTGAGTTTGAGAAA 
CCCCAGCCTAATGAAACCCTAGGTGTTGGGCTCTGGAATGGGACTTTGTCCCTTCTAATTAT 
TATCTCTTTCCAGCCTCATTCAGCTATTCTTACTGACATACCAGTCTTTAGCTGGTGCTATG 
GTCTGTTCTTTAGTTCTAGTTTGTATCCCCTCAAAAGCCATTATGTTGAAATCCTAATCCCC 
AAGGTGATGGCATTAAGAAGTGGGCCTTTGGGAAGTGATTAGATCAGGAGTGCAGAGCCCTC 
ATGATTAGGATTAGTGCCCTTATTTAAAAAGGCCCCAGAGAGCTAACTCACCCTTCCACCAT 
ATGAGGACGTGGCAAGAAGATGACATGTATGAGAACCAAAAAACAGCTGTCGCCAAACACCG 
ACTCTGTCGTTGCCTTGATCTTGAACTTCCAGCCTCCAGAACTATGAGAAATAAAATTCTGG 
TTGTTTGTAGCCTAA 



TTTIf 



FIGURE 20 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ ss .DNA40594 
xsubunit 1 of 1, 359 aa, 1 stop 
xMW: 38899, pi: 5.21, NX(S/T): 0 

MKLGCVLMAWALYLSLGVLWVAQMLLAASFETLQCEGPVCTEESSCHTEDDLTDAREAGFQV 
KAYTFSEPPHLIVSYDWLILQGPAKPVFEGDLLVLRCQAWQDWPLTQVTFYRDGSALGPPGP 
NREFSITWQKADSGHYHCSGIFQSPGPGIPETASWAITVQELFPAPILRAVPSAEPQAGS 
PMTLSCQTKLPLQRSAARLLFSFYKDGRIVQSRGLSSEFQIPTASEDHSGSYWCEAATEDNQ 
VWKQSPQLEIRVQGASSSAAPPTLNPAPQKSAAPGTAPEEAPGPLPPPPTPSSEDPGFSSPL 
GMPDPHLYHQMGLLLKHMQDVRVLLGHLLMELRELSGHQKPGTTKATAE 

Signal sequence: 

amino acids 1-17 

Leucine zipper pattern sequence: 

amino acids 12-33 

Protein kinase C phosphorylation site: 

amino acids 353-355 



rum 



FIGURE 21 



CCCACGCGTCCGCCCACGCGTCCGCCCACGGGTCCGCCCACGCGTCCGGGCCACCAGAAGTT 
TGAGCCTCTTTGGTAGCAGGAGGCTGGAAGAAAGGACAGAa.GTAGCTCTGGCTGTGATGGGG 
ATCTTACTGGGCCTGCTACTCCTGGGGCACCTAACAGTGGACACTTATGGCCGTCCCATCCT 
GGAAGTGCCAGAGAGTGTAACAGGACCTTGGAAAGGGGATGTGAATCTTCCCTGCACCTATG 
ACCCCCTGCAAGGCTACACCCAAGTCTTGGTGAAGTGGCTGGTACAACGTGGCTCAGACCCT 
GTCACCATCTTTCTACGTGACTCTTCTGGAGACCATATCCAGCAGGCAAAGTACCAGGGCCG 
CCTGCATGTGAGCCACAAGGTTCCAGGAGATGTATCCCTCCAATTGA6CACCCTGGAGATGG 
ATGACCGGAGCCACTACACGTGTGAAGTCACCTGGCAGACTCCTGATGGCAACCAAGTCGTG 
AGAGATAAGATTACTGAGCTCCGT6TCCAGAAACTCTCTGTCTCCAAGCCCACAGTGACAAC 
TGGCAGCGGTTATGGCTTCACGGTGCCCCAGGGAATGAGGATTAGCCTTCAATGCCAGGCTC 
GGGGTTCTCCTCCCATCAGTTATATTTGGTATAAGCAACAGACTAATAACCAGGAACCCATC 
AAAGTAGCAACCCTAAGTACCTTACTCTTCAAGCCTGCGGTGATAGCCGACTCAGGCTCCTA 
TTTCTGCACTGCCAAGGGCCAGGTTGGCTCTGAGCAGCACAGCGACATTGTGAAGTTTGTGG 
TCAAAGACTCCTCAAAGCTACTCAAGACCAAGACTGAGGCACCTACAACCATGACATACCCC 
TTGAAAGCAACATCTACAGTGAAGCAGTCCTGGGACTGGACCACTGACATGGATGGCTACCT 
TGGAGAGACCAGTGCTGGGCCAGGAAAGAGCCTGCCTGTCTTTGCCATCATCCTCATCATCT 
CCTTGTGCTGTATGGTGGTTTTTACCATGGCCTATATCATGCTCTGTCGGAAGACATCCCAA 
CAAGAGCATGTCTACGAAGCAGCCAG GTAAG AAAGTCTCTCCTCTTCCATTTTTGACCCCGT 
CCCTGCCCTCAATTTTGATTACTGGCAGGAAATGTGGAGGAAGGGGGGTGTGGCACAGACCC 
AATCCTAAGGCCGGAGGCCTTCAGGGTCAGGACATAGCTGCCTTCCCTCTCTCAGGCACCTT 
CTGAGGTTGTTTTGGCCCTCTGAACACAAAGGATAATTTAGATCCATCTGCCTTCTGCTTCC 
AGAATCCCTGGGTGGTAGGATCCTGATAATTAATTGGCAAGAATTGAGGCAGAAGGGTGGGA 
AACCAGGACCACAGCCCCAAGTCCCTTCTTATGGGTGGTGGGCTCTTGGGCCATAGGGCACA 
TGCCAGAGAGGCCAACGACTCTGGAGAAACCATGAGGGTGGCCATCTTCGCAAGTGGCTGCT 
CCAGTGATGAGCCAACTTCCCAGAATCTGGGCAACAACTACTCTGATGAGCCCTGCATAGGA 
CAGGAGTACCAGATCATCGCCCAGATCAATGGCAACTACGCCCGCCTGCTGGACACAGTTCC 
TCTGGATTATGAGTTTCTGGCCACTGAGGGCAAAAGTGTCTGTTAAAAATGCCCCATTAGGC 
CAGGATCTGCTGACATAATTGCCTAGTCAGTCCTTGCCTTCTGCATGGCCTTCTTCCCTGCT 
ACCTCTCTTCCTGGATAGCCCAAAGTGTCCGCCTACCAACACTGGAGCCGCTGGGAGTCACT 
GGCTTTGCCCTGGAATTTGCCAGATGCATCTCAAGTAAGCCAGCTGCTGGATTTGGCTCTGG 
GCCCTTCTAGTATCTCTGCCGGGGGCTTCTGGTACTCCTCTCTAAATACCAGAGGGAAGATG 
CCCATAGCACTAGGACTTGGTCATCATGCCTACAGACACTATTCAACTTTGGCATCTTGCCA 
CCAGAAGACCCGAGGGAGGCTCAGCTCTGCCAGCTCAGAGGACCAGCTATATCCAGGATCAT 
TTCTCTTTCTTCAGGGCCAGACAGCTTTTAATTGAAATTGTTATTTCACAGGCCAGGGTTCA 
GTTCTGCTCCTCCACTATAAGTCTAATGTTCTGACTCTCTCCTGGTGCTCAATAAATATCTA 
ATCATAACAGC 



FIGURE 22 



>< /usr / seqdb2 / s st /DNA/Dnaseqs . min/ ss . DNA4 54 16 
xsubunit 1 of 1, 321 aa, 1 stop 
xMW: 35544, pi: 8.51, NX{S/T): 0 

MGILLGLLLLGHLTVDTyGRPILEVPESVTGPWKGDVNLPCTYDPLQGYTQVLVKWLVQRGS 
DPVTIFLRDSSGDHIQQAKYQGRLHVSHKVPGDVSLQLSTLEMDDRSHYTCEVTWQTPDGNQ 
VVRDKITELRVQKLSVSKPTVTTGS6YGFTVPQGMRISLQCQARGSPPISYIWYKQQTNNQE 
PIKVATLSTLLFKPAVIADSGSYFCTAKGQVGSEQHSDIVKFWKDSSKLLKTKTEAPTTMT 
YPLKATSTVKQSWDWTTDMDGYLGETSAGPGKSLPVFAIILIISLCCMWFTMAYIMLCRKT 
SQQEHVYEAAR 

Signal Sequence: 

amino acids 1-19 

Glycosaiainoglycan attachment site: 

amino acids 149-152 

Transmembrane domain: 

amino acids 282-300 



FIGURE 23 

GCGCCGGGAGCCCATCTGCCCCCAGGGGCACGGGGCGCGGGGCCGGCTCCCGCCCGGCACAT 
GGCTGCAGCCACCTCGCGCGCACCCCGAGGCGCCGCGCCCAGCTCGCCCGAGGTCCGTCGGA 
GGCGCCCGGCCGCCCCGGAGCCAAGCAGCAACTGAGCGGGGAAGCGCCCGCGTCCGGGGATC 
GGGATGTCCCTCCTCCTTCTCCTCTTGCTAGTTTCCTACTATGTTGGAACCTTGGGGACTCA 
CACTGAGATCAAGAGAGTGGCAGAGGAAAAGGTCACTTTGCCCTGCCACCATCAACTGGGGC 
TTCCAGAAAAAGACACTCTGGATATTGAATGGCTGCTCACCGATAATGAAGGGAACCAAAAA 
GTGGTGATCACTTACTCCAGTCGTCATGTCTACAATAACTTGACTGAGGAACAGAAGGGCCG 
AGTGGCCTTTGCTTCCAATTTCCTGGCAGGAGATGCCTCCTTGCAGATTGAACCTCTGAAGC 
CCAGTGATGAGGGCCGGTACACCTGTAAGGTTAAGAATTCAGGGCGCTACGTGTGGAGCCAT 
GTCATCTTAAAAGTCTTAGTGAGACCATCCAAGCCCAAGTGTGAGTTGGAAGGAGAGCTGAC 
AGAAGGAAGTGACCTGACTTTGCAGTGTGAGTCATCCTCTGGCACAGAGCCCATTGTGTATT 
ACTGGCAGCGAATCCGAGAGAAAGAGGGAGAGGATGAACGTCTGCCTCCCAAATCTAGGATT 
GACTACAACCACCCTGGACGAGTTCTGCTGCAGAATCTTACCATGTCCTACTCTGGACTGTA 
CCAGTGCACAGCAGGCAACGAAGCTGGGAAGGAAAGCTGTGTGGTGCGAGTAACTGTACAGT 
ATGTACAAAGCATCGGCATGGTTGCAGGAGCAGTGACAGGCATAGTGGCTGGAGCCCTGCTG 
ATTTTCCTCTTGGTGTGGCTGCTAATCCGAAGGAAAGACAAAGAAAGATATGAGGAAGAAGA 
GAGACCTAATGAAATTCGAGAAGATGCTGAAGCTCCAAAAGCCCGTCTTGTGAAACCCAGCT 
CCTCTTCCTCAGGCTCTCGGAGCTCACGCTCTGGTTCTTCCTCCACTCGCTCCACAGCAAAT 
AGTGCCTCACGCAGCCAGCGGACACTGTCAACTGACGCAGCACeCCAGCCAGGGCTGGCCAC 
CCAGGCATACAGCCTAGTGGGGCCAGAGGTGAGAGGTTCTGAACCAAAGAAAGTCCACCATG 
CTAATCTGACCAAAGCAGAAACCACACCCAGCATGATCCCCAGCCAGAGCAGAGCCTTCCAA 
ACGGTCTGAATTACAATGGACTTGACTCCCACGCTTTCCTAGGAGTCAGGGTCTTTGGACTC 
TTCTCGTCATTGGAGCTCAAGTCACCAGCCACACAACCAGATGAGAGGTCATCTAAGTAGCA 
GTGAGCATTGCACGGAACAGATTCAGATGAGCATTTTCCTTATACAATACCAAACAAGCAAA 
AGGATGTAAGCTGATTCATCTGTAAAAAGGCATCTTATTGTGCCTTTAGACCAGAGTAAGGG 
AAAGCAGGAGTCCAAATCTATTTGTTGACCAGGACCTGTGGTGAGAAGGTTGGGGAAAGGTG 
AGGTGAATATACCTAAAACTTTTAATGTGGGATATTTTGTATCAGTGCTTTGATTCACAATT 
TTCAAGAGGAAATGGGATGCTGTTTGTAAATTTTCTATGCATTTCTGCAAACTTATTGGATT 
ATTAGTTATTCAGACAGTCAAGCAGAACCCACAGCCTTATTACACCTGTCTACACCATGTAC 
TGAGCTAACCACTTCTAAGAAACTCCAAAAAAGGAAACATGTGTCTTCTATTCTGACTTAAC 
TTCATTTGTCATAAGGTTTGGATATTAATTTCAAGGGGAGTTGAAATAGTGGGAGATGGAGA 
AGAGTGAATGAGTTTCTCCCACTCTATACTAATCTCACTATTTGTATTGAGCCCAAAATAAC 
TATGAAAGGAGACAAAAATTTGTGACAAAGGATTGTGAAGAGCTTTCCATCTTCATGATGTT 
ATGAGGATTGTTGACAAACATTAGAAATATATAATGGAGCAATTGTGGATTTCCCCTCAAAT 
CAGATGCCTCTAAGGACTTTCCTGCTAGATATTTCTGGAAGGAGAAAATACAACATGTCATT 
TATCAACGTCCTTAGAAAGAATTCTTCTAGAGAAAAAGGGATCTAGGAATGCTGAAAGATTA 
CCCAACATACCATTATAGTCTCTTCTTTCTGAGAAAATGTGAAACCAGAATTGCAAGACTGG 
GTGGACTAGAAAGGGAGATTAGATCAGTTTTCTCTTAATATGTCAAGGAAGGTAGCCGGGCA 
TGGTGCCAGGCACCTGTAGGAAAATCCAGCAGGTGGAGGTTGCAGTGAGCCGAGATTATGCC 
ATTGCACTCCAGCCTGGGTGACAGAGCGGGACTCCGTCTC 



FIGURE 24 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA45419 
xsubunit 1 of 1, 373 aa, 1 stop 
xMW: 41281, pi: 8.33, NX(S/T): 3 

MSLLLLLLLVSYYVGTLGTHTEIKRVAEEKVTLPCHHQLGLPEKDTLDIEWI.LTDNEGNQKV 
VITYSSRHVYNNLTEEQKGRVAFASNFLAGDASLQIEPLKPSDEGRYTCKVKNSGRYVWSHV 
ILKVLVRPSKPKCELEGELTEGSDLTLQCESSSGTEPIVYYWQRIREKEGEDERLPPKSRID 
YNHPGRVLLQNLTMS YSGLYQCTAGNEAGKESCWRVTVQYVQS IGMVAGAVTGI VAGALL I 
FLLVWLLIRRKDKERYEEEERPNEIREDAEAPKARLVKPSSSSSGSRSSRSGSSSTRSTANS 
ASRSQRTLSTDAAPQPGIATQAYSLVGPEWGSEPKKVHHANLTKAETTPSMIPSQSRAFQTV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 232-251 



FIGURE 25 

GTCGTTCOTTTGCrCTCTCGCGCCCAGTCCTCCTCCCTGGTTCTCCTCAGCCGCTGTCGGAGGAGAGCACCCGQA 

GACGCGGGCTGCAGTCGCGGCGGCTTCTCCCCGCCTGGGCGGCCTCGCCGCTGGGCAGGTGCTGAGCGCCCCTAG 

AGCCTCCCTTGCCGCCTCCCTCCTCTGCCCGGCCGCAGCAGTGCACATGGGGTGTTGGAGGTAGATGGGCTCCCG 

GCCCGGGAGGCGGCGGTGGATGCGGCGCTGGGCAGAAGCAGCCGCCGATTCCAGCTGCCCCGCGCGCCCCGGGCG 

CCCCTGCGAGTCCCCGGTTCAGCCATGGGGACCTCTCCGAGCAGCAGCa^CCGCCCTCGCCTCCTGCAGCCGCATC 

GCCCGCCGAGCCACAGCCACGATGATCGCGGGCTCCCTTCTCCTGCTTGGATTCCTTAGCACCACCACAGCTCAG 

CCa^GAAaW3AAGGCCTCGAATCTaiTTGGCACM:ACCGCCATGTTGACCGTGCC».CCGGC(^ 

GACAAGTGTCCAGCaGGAACCTATGTCTCTGAGCATTGTACCAACACAAGCCTGCGCGTCTGCAGCAG 

GTGGGGACCTTTACCAGGCATGAGAATGGCATAGAGAAATGCCa.TGACTGTAGTCAGCa^TGCCCATG^ 

ATTGAGAAATTACCTTGTGCTGCCTTGACTGACCGAGAATGCACTTGCCCACCTGGCATGTTCCAGTCTAACGCT 

ACCTGTGCCCCCCATACGGTGTGTCCTGTGGGTTGGGGTGTGCGGAAGAAAGGGACAGAGACTGAGGATGTGCGG 

TGTAAGCAGTGTGCTCGGGGTACCTTCTCAGATGTGCCTTCTAGTGTGATGAAATGCAAAGCATACACAGACTGT 

CTGAGTCAGAACCTGGTGGTGATCAAGCCGGGGACCAAGGAGACAGACAACGTCTGTGGCACa.CTCCCGTCCTTC 

TCCAGCTCCACCTCACCTTCCCCTGGCACaGCCATCTTTCCa^CGCCCTGAGCACATGGAAACCCATGAA 

TCCTCCACTTATGTTCCCaj^GGCATGAACTCaACa.GAATCCAACTCTTCTGCCTCTGTTAGACCAAAGGT^ 

AGTAGCATCCAGGAAGGGACAGTCCCTQAO^CACAAGCTCy^GCAAGGGGGAAGQAAGACGTGAACAAGAC^ 

Ca^CCTTCAGGTAGTCS^CCACCAGCaAGGCCCCCACCACAGACaCATCCTGa^ 

GCCACTGGGGGCGAGAAGTCC7^GCACGCCCATCAAGGGCCCCaAGAGGGGACATCCTAGACAGAACCTAC».CAAG 

Cy^TTTTGACATOf^TGAGCATTTGCCCTGGATGATTGTGCTTTTCCTGCTGCrGGTGCTTGTGGTGATTGTGGTG 

TGCAGTATCCGGAAAAGCTCQAGGACTCTGAAAAAGGGGCCCCGGCAGGATCCCAGTGCCATTGTGGAAAAGGCA 

GGGCTGAAGAAATCCATGACTCCAACCCAGAACCGGGAGAAATGGATCTACTACTGCAATGGCCATGGTATCGAT 

ATCCTGAAGCTTGTAGCAGCCCAAGTGGGAAGCCAGTGGAAAGATATCTATCAGTTTCTTTGCAATGCCAGTGAG 

AGGGAGGTTGCTGCTTTCTCCAATGGGTACaCAGCCGACCACGAGCGGGCCTACGCAGCTCTGCAGCACTGGACC 

ATCCGGGGCCCCGAGGCCAGCCTCGCCCAGCTAATTAGCGCCCTGCGCCAGCACCGGAGAAACGATGTTGTGGAG 

AAGATTCGTGGGCTGATGGAAGACACCACCCAGCTGGAAACTGACAAACTAGCTCTCCCGATGAGCCCCAGCCCG 

CTTAGCCCGAGCCCCATCCCCAGCCCCAAC6CGAAACTTGAGAATTCCGCTCTCCTGACGGTGGAGCCTTCCCCA 

CaGGAO^GAACAAGGGCTTCTTCGTGGATGAGTCGGAGCCCCTTCTCCGCTGTGACTCTACATCCAGCGGCTC^ 

TCCGCGCTGAGOiGGAACGGTTCCTTTATTACCAAAGAAAAGAAGGACACAGTGTTGCGGCAGGTACGCCT 

CCCTGTGACTTGCAGCCTATCTTTGATGACAT6CTCCACTTTCTAAATCCTQAGGAGCTGCGGGTGATTGAAQAG 

ATTCCCCAGGCTGAGGACAAACTAGACCGGCTATT(XAAATTATTGGAGTCaAGAGCCAGGAAGCCAGCCA 

CTCCTGGACTCTGTTTATAGCC:a.TCTTCCTGACCTGCTG3agAACATAGGGATACTGCy^TTCTGGAA^ 

ATTTAGTGGCAGGGTGGTTTTTTAATTTTCTTCTGTTTCTGATTTTTGTTGTTTGGGGTGTGTGTGTGTGTTTGT 




ATACCCACCyiCTAAAGTTTTTTAAGTTCCATATTTTCTCCATTTTGCCTTCTTATGTATTTTCAAGATTATTCTG 

TGCACTTTAAATTTACTTAACTTACCS^TAAATGCAGTGTGACTTTTCCCaiCACyiCTGGATTGTGAGGCTCTTA^^ 

TTCTTAAAAGTATAATGGCATCTTGTGAATCCTATAAGCAGTCTTTATGTCTCTTAACATTCACACCTACTTTTT 

AAAAAO^TATTATTACTATTTTTATTATTGTTTGTCCTTTATAAATTTTCTTAAAGATTAAGAAAATTTAAGA 

CCCCATTGAGTTACTGTAATGCAATTCAACTTTGAGTTATCTTTTAAATATGTCTTGTATAGTTCATATTCATGG 

CTGAAACTTGACCACACTATTGCTGATTGTATGGTTTTCACCTGGACACCGTGTAGAATGCTTGATTACTTGTAC 

TCTTCTTATGCTAATATGCTCTGGGCTGGAGAAATGAAATCCTCAAGCCATCAGGATTTGCTATTTAAGTGGCTT 

GACAACTGGGCCaCCyiAAGAACTTGAACTTCaCCTTTTAGGATTTGAGCTGTTCTGGAACACATTGCTGCACT^ 

GGAAAGTCAAAATCAAGTGCCAGTGGCGCCCTTTCCATAGAGAATTTGCCCa.GCTTTGCTTTAAAAGATGTCTTG 

TTTTTTATATACACATAATO^TAGGTCCaATCrGCTCTCaAGGCCTTGGTCCTGGTGGGAT^ 

ACTTTAATTAAAAATGGCTGCAACTGTAAGAACCCTTGTCTGATATATTTGCAACTATGCTCCCATTTACAAATG 

TACCTTCTAATGCTCAGTTGCCAGGTTCa^TGaWVGGTGGCGTGGACTCCCTTTGTGTGGGTGGGGTTTGTG^ 

GTAGTGGTGAAGGACCGATATCAGAAAAATGCCTTCy^GTGTACTAATTTATTAATAAACATTAGGTGTTTGTTA 

AAAAAAAAA 



FIGURE 26 



>< /usr/ seqdb2 / s s t /DNA/Dnaseqs . min/ ss . DNA5 2 5 94 
xsiibunit 1 of 1, 655 aa, 1 stop 
xMW: 71845, pi: 8.22, NX(S/T}: 8 

MGTSPSSSTALASCSRIARRATATMIAGSLLLLGFLSTTTAQPEQKASNLIGTYRHVDRATG 
QVLTCDKCPAGTYVSEHCTNTSLRVCSSCPVGTFTRHENGIEKCHDCSQPCPWPMIEKLPCA 
ALTDRECTCPPGMFQSNATCAPHTVCPVGWGVRKKGTETEDVRCKQCARGTFSDVPSSVMKC 
KAYTDCLSQNLWIKPGTKETDNVCGTLPSFSSSTSPSPGTAIFPRPEHMETHEVPSSTYVP 
KGMNSTESNSSASVRPKVLSSIQEGTVPDNTSSARGKEDVNKTLPNLQVWHQQGPHHRHIL 
KLLPSMEATGGEKSSTPIKGPKRGHPRQNLHKHFDINEHLPWMIVLFLLLVLWIWCSIRK 
SSRTLKKGPRQDPSAIVEKAGLKKSMTPTQNREKWIYYCNGHGIDILKLVAAQVGSQWKDIY 
QFLCNASEREVAAFSNGYTADHERAYAALQHWTIRGPEASLAQLISALRQHRRNDWEKIRG 
LMEDTTQLETDKLALPMS PSPLSPSPIPS PNAKLENS ALLTVE PS PQDKNKGFFVDE S EPLL 
RCDSTSSGS S ALSRNGS F I TKE KKDTVLRQVRLDPCDLQP I FDDMLHFLNPEELRVI E E I PQ 
AEDKLDRLFE I IGVKSQEASQTLLDSVYSHLPDLL 

Signal sequence: 
amino acids 1-41 

Transmembrane domain: 

amino acids 350-370 



FIGURE 27 

ATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTCCGTGGTGCCATCTACATTTTTGGGA 
CTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAGGTCCTGAAATAGTCAC 
CATGGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCTTCTCATTCCGATCGCTTTTTGGCC 
TTGATGATTTGAAAATAAGTCCTGTTGCACCAGATGCAGATGCTGTTGCTGCACAGATCCTG 
TCACTGCTGCCATTGAAGTTTTTTCCAATCATCGTCATTGGGATCATTGCATTGATATTAGC 
ACTGGCCATTGGTCTGGGCATCCACTTCGACTGCTCAGGGAAGTACAGATGTCGCTCATCCT 
TTAAGTGTATCGAGCTGATAGCTCGATGTGACGGAGTCTCGGATTGCAAAGACGGGGAGGAC 
GAGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCCAGGTGTTCACAGCTGCTTC 
GTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCCCAAC 
TGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGGCAGTTC 
CGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCATTACACCA 
CTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAGTGCACAGCCT 
GTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTCCTTGCTCTCGCAG 
TGGCCCTGGCAGGCCAGCCTTCAGTTCCAGGGCTACCACCTGTGCGGGGGCTCTGTCATCAC 
GCCCCTGTGGATCATCACTGCTGCACACTGTGTTTATGACTTGTACCTCCCCAAGTCATGGA 
CCATCCAGGTGGGTCTAGTTTCCCTGTTGGACAATCCAGCCCCATCCCACTTGGTGGAGAAG 
ATTGTCTACCACAGCAAGTACAAGCCAAAGAGGCTGGGCAATGACATCGCCCTTATGAAGCT 
GGCCGGGCCACTCACGTTCAATGAAATGATCCAGCCTGTGTGCCTGCCCAACTCTGAAGAGA 
ACTTCCCCGATGGAAAAGTGTGCTGGACGTCAGGATGGGGGGCCACAGAGGATGGAGGTGAC 
GCCTCCCCTGTCCTGAACCACGCGGCCGTCCCTTTGATTTCCAACAAGATCTGCAACCACA6 
GGACGTGTACGGTGGCATCATCTCCCCCTCCATGCTCTGCGCGGGCTACCTGACGGGTGGCG 
TGGACAGCTGCCAGGGGGACAGCGGGGGGCCCCTGGTGTGTCAAGAGAGGAGGCTGTGGAAG 
TTAGTGGGAGCGACCAGCTTTGGCATCGGCTGCGCAGAGGTGAACAAGCCTGGGGTGTACAC 
CCGTGTCACCTCCTTCCTGGACTGGATCCACGAGCAGATGGAGAGAGACCTAAAAACCTGAA 
GAGGAAGGGGACAAGTAGCCACCTGAGTTCCTGAGGTGATGAAGACAGCCCGATCCTCCCCT 
GGACTCCCGTGTAGGAACCTGCACACGAGCAGACACCCTTGGAGCTCTGAGTTCCGGCACCA 
GTAGCAGGCCCGAAAGAGGCACCCTTCCATCTGATTCCAGCACAACCTTCAAGCTGCTTTTT 
GTTTTTTGTTTTTTTGAGGTGGAGTCTCGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCGAAA 
TCCCTGCTCACTGCAGCCTCCGCTTCCCTGGTTCAAGCGATTCTCTTGCCTCAGCTTCCCCA 
GTAGCTGGGACCACAGGTGCCCGCCACCACACCCAACTAATTTTTGTATTTTTAGTAGAGAC 
AGGGTTTCACCATGTTGGCCAGGCTGCTCTCAAACCCCTGACCTCAAATGATGTGCCTGCTT 
CAGCCTCCCACAGTGCTGGGATTACAGGCATGGGCCACCACGCCTAGCCTCACGCTCCTTTC 
TGATCTTCACTAAGAACAAAAGAAGCAGCAACTTGCAAGGGCGGCCTTTCCCACTGGTCCAT 
CTGGTTTTCTCTCCAGGGTCTTGCAAAATTCCTGACGAGATAAGCAGTTATGTGACCTCACG 
TGCAAAGCCACCAACAGCCACTCAGAAAAGACGCACCAGCCCAGAAGTGCAGAACTGCAGTC 
ACTGCACGTTTTCATCTCTAGGGACCAGAACCAAACCCACCCTTTCTACTTCCAAGACTTAT 
TTTCACATGTGGGGAGGTTAATCTAGGAATGACTCGTTTAAGGCCTATTTTCATGATTTCTT 
TGTAGCATTTGGTGCTTGACGTATTATTGTCCTTTGATTCCAAATAATATGTTTCCTTCCCT 
CATTGTCTGGCGTGTCTGCGTGGACTGGTGACGTGAATCAAAATCATCCACTGAAA 



FIGURE 28 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA45234 
xsLibunit 1 of 1, 453 aa, 1 stop 
xMW: 49334, pi: 6.32, NX{S/T): 1 

MGENDPPAVEAPFSFRSLFGLDDLKISPVAPDADAVAAQILSLLPLKFFPI IVIGI lALILA 
LAIGLGIHFDCSGKYRCRSSFKCIELIARCDGVSDCKDGEDEYRCVRVGGQNAVLQVFTAAS 
WKTMCSDDWKGHYANVACAQLGFPSYVSSDNLRVSSLEGQFREEFVSIDHLLPDDKVTALHH 
SVYVREGCASGHWTLQCTACGHRRGYSSRIVGGlSnyiSLLSQWPWQASLQFQGYHLCGGSVIT 
PLWIITAAHCVYDLYLPKSWTIQVGLVSLLDNPAPSHLVEKIVYHSKYKPKRLGNDIALMKL 
AGPLTFNEMIQPVCLPNSEENFPDGKVCWTSGWGATEDGGDASPVLNHAAVPLISNKICNHR 
DVYGGI I SPSMLCAGYLTGGVDSCQGDSGGPLVCQERRLWKLVGATSFGIGCAEVNKPGVYT 
RVTSFLDWIHEQMERDLKT 

Signal Peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 240-284 



FIGURE 29 



CCCACGCGTCCGTCCTAGTCCCCGGGCCAACTCGGACAGTTTGCTCATTTATTGC^UkCGGTCaAGGCTGGCTTGT 

GCCaGAACGGCGCGCGCGCGCGCACGCAOSCaCACACACGGGGGQZU^CTTTTTTAAAaATGAAAGG 

GCTCAGCGGCGGCGCGGGCGCTGCGCGAGGGCTCCGGAGCTGACTCGCCGAGGCAGGAAATCCCTCCGGTCGCGA 

CGCCCGGCCCCGGCTCGGCGCCCGCGTGGGATGGTGCAGCGCTCGCCGCCGGGCCCGAGAGCTGCTGCACTGAAG 

GCCGGCGACGATGGCAGCGCGCCCGCTGCCCGTGTCCCCCGCCCGCGCCCTCCTGCTCGCCCTGGCCGGTGCTCT 

GCTCGCGCCOTGCGAGGCCCGAGGGGTGAGCTTATGGAACCAAGGAAGAGCTGATGJ^GTTGTCaiGTGCCTCTGT 

TCGGAGTGGGGACCTCTGGATCCCAGTGAAGAGCTTCGACTCCAAGAATCATCCAGAAGTGCTGAATATTCGACT 

ACS^CGGGAAAGCyiAAGAACTGATCATAAATCTGGAAAGAAATGAAGGTCTCaTTGCCAGCA.GTTTCAC 

CCACTATCTGCAAGACGGTACTGATGTCTCCCTCGCTCGAAATTACACGGGTCACTGTTACTACCATGGACATGT 

ACGGGGATATTCTGATTCAGCAGTCAGTCTCAGCACGTGTTCTGGTCrCA.GGGGACTTATTGT6TTT6AAAATGA 

AAGCTATGTCTTAGAACCaATGAAAAGTGCAACCAACAGATACAAACTCTTCCCAGCGAAGAAGCTGAA 

CCGGGGATCATGTGGATCACaT<::aCAACACACCaAA.CCTCGCTGCAAAGAATGTGTT 

ATGGGG?^GAAGGCATAAAAGAGAGACCCTCAAGGCAACTAAGTATGTGGAGCTGGTGATCGTGGCAGACffi^ 

AGAGTTTCa.GAGG(:a^GGAAAAGATCTGGAAAAAGTTAAGCAGCGATTAATAGAGATTGCTAATCACGTTGAC^ 

GTTTTACAGACCACTG^CATTCGGATCGTGTTGGTAGGCGTGGAAGTGTGGAA.TGACATGGACa^ 

AAGTCAGGACCCATTCACCaGCCTCCATGAATTTCTGGACTGGAGGAAGATG2^GCTTCTACCTCGaUi^ 

TGACAaTGCGC:AGCTTGTCAGTGGGGTTTATTTCCAAGGGACCa.CCATCGGa^TGGCCCCA^ 

CACGGCAGACCAGTCTGGGGGAATTGTC:a.TGGACCATTCaGACAATCCCCTTGGTGCAGCCGT^ 

TGAGCTGGGCCACAATTTCGGGATGAATCATGACACACTGGACaGGGGOTGTAGCTGTCa^ 

AGGAGGCTGCa^TCy^TGAACGCTTCCaCCGGGTACCCATTTCCCATGGTGTTCyiGCAGTTGC^ 

GGAGACCAGCCTGGAGAAAGQAATGGGGGTGTGCCTGTTTAACCTGCCGGAAGTCAGGGAGTCTTTCGGGGGCCA 

GAAGTGTGGGAACAGATTTGTGGaAGAAGGAGAGGAGTGTGACTGTGGGGAGCCAGAGGAATGTATGAATCGCTG 

CTGCAATGCCACCACCTGTACCCTGAAGCCGGACGCTGTGTGCGCACATGGGCTGTGCTGTGAAGACTGCCAGCT 

GAAGCCTGCa.GGAACAGCGTGCAGGGACTCC:aGCAACTCCTGTGACCTCCCAQAGTTCTGCaLCAGGGGCCAGCCC 

TCACTGCCCAGCCAATGTGTACCTGCACGATGGGCACTCATGTCAGGATGTGaACGGCTACTGCTACAATGGCAT 

CTGCCAGACTCACGAGCAGCAGTGTGTCACGCTCTGGGGACCAGGTGCTAAACCTGCCCCTGGGATCTGCTTTGA 

GAGAGTCTiATTCTGCAGGTGATCCTTATGGCAACTGTGGaU^GTCTCGAAQAGTTCCTTTGCC^^ 

GAGAQATGCTAAATGTGGAAAAATCCAGTGTCAAGGAGGTGCCAGCCGGCCAGTCATTGGTACCaATGCCGTT^ 

CATAGAAACaAACATCCCTCTGCy^GCAAGGAGGCCGGATTCTGTGCCGGGGGACCCACGTGTACTTGGGCGATGA 

CATGCCGGACCCAGGGCTTGTGCTTGCAGGCACAAAGTGTGCAGATGGAAAAATCTGCCTGAATCGTCAATGTCA 

AAATATTAGTGTCTTTGGGGTTO^CGAGTGTGCaATGCAGTGCCACGGCAGAGGGGTGTGaiAC^ 

CTGCCACTGCGAGGCCCACTGGGCACCTCCCTTCTGTGACAAGTTTGGCTTTGGAGGAAGCACAGACAGCGGCCC 

CATCCGGCAAGCAGAAGCaAGGCAGGAAGCTGCAGAGTCCAACaGGGAGCGCGGCCAGGGCCa 

ATCGCAGGAGCATGCGTCTACTGCCTCACTGACACTCATCTGAGCCCTCCCATGACATGGAGACCGTGACCAGTG 

CTGCTGCAGAGGAGGTCACGCGTCCCCaAGGCCTCCTGTGACTGGCAGCATTGACTCTGTGGCTTTGCCATCGTT 

TCCATGACAACAGACSiCaACACAGTTCTCGGGGCTCAGGAGGGGAAGTCCAGCCTACa^GG^ 

CAGTGCAAGGAAGGGCAGCGACTTCCTGGTTGAGCTTCTGCTAAAACATGGACATGCTTCAGTGCTGCTCCTGAG 

AGAGTAGCAGGTTACC3VCTCTGGCAGGCCCCAGCCCTGCAGCAAGGAGGAAGAGGACTCAAAAGTCTGGCCTTTC 

ACTGAGCCTCCAO^GCa^GTGGGGGAGAAGCAAGGGTTGGGCCCAGTGTCCCCTTTCCCCSlGTGACaCCTCAGCCT 

TGGCa^GCCCTGATGACTGGTCTCTGGCTGCAACTTAATGCTCTGATATGGCTTTTAGCATTTATTATATGAAAAT 

AGCy^GGGTTTTAGTTTTTAATTTATCAGAGACCCTGCCACCCaTTCCATCTCCATCCAAGCAAACTGAATGG 

TGAAACAAACTGGAGAAGAAGGTAGGAGAAAGGGCGGTGAACTCTGGCTCTTTGCTGTGGACATGCGTGACCAGC 

AGTACTCAGGTTTGAGGGTTTGGAGAAAGCCAGGGAACCCaCAQAGTCACCSUVCCCTTCA 

TGTTAAAAAGTGAAAACAATGTAAGAGCCTAACTCCATCCCCCGTGGCCATTACTGCATAAAATAGAGTGCATTT 
GAAAT 



FIGURE 30 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA49624 
xsubunit 1 of 1, 735 aa, 1 stop 
xMW: 80177, pi: 7.08, NX(S/T): 5 

MAARPLPVSPARA.LLriAIiAGALIiAPCEARGVSLWNQGRADEVVSASVRSGDLWIPVKSFDSK 
NHPEVLNIRLQRESKELIINLERNEGLIASSFTETHYLQDGTDVSLARNYTGHCYYHGHVRG 
YSDSAVSLSTCSGLRGLIVFENESWLEPMKSATNRYKLFPAKKLKSVRGSCGSHHNTPNLA 
AKNVFPPPSQTWARRHKRETLKATKYVELVIVADNREFQRQGKDLEKVKQRLIEIANHVDKF 
YRPLNIRIVLVGVEVWNDMDKCSVSQDPFTSLHEFLDWRKMKLLPRKSHDNAQLVSGVYFQG 
TTIGMAPIMSMCTADQSGGIVMDHSDNPLGAAVTLAHELGHNFGMNHDTLDRGCSCQMAVEK 
GGCIMNASTGYPFPMVFSSCSRKDLETSLEKGMGVCLFNLPEVRESFGGQKCGNRFVEEGEE 
CDCGEPEECMNRCCNATTCTLKPDAVCAHGLCCEDCQLKPAGTACRDSSNSCDLPEFCTGAS 
PHCPANVYLHDGHSCQDVDGYCYNGICQTHEQQCVTLWGPGAKPAPGICFERVNSAGDPYGN 
CGKVSKSSFAKCEMRDAKCGKIQCQGGASRPVIGTNAVSIETNIPLQQGGRILCRGTHVYLG 
DDMPDPGLVIAGTKCADGKICLimQCQMISVFGVHECAMQCHGRGVCNNRKNCHCEAHWAPP 
FCDKFGFGGSTDSGPIRQAEARQEAAESNRERGQGQEPVGSQEHASTASLTLI 



Signal peptide: 

amino acids 1-28 



FIGURE 31 



TCCCAAGGCTTCTTGGATGGCAGATGATTNTGGGGTTTTGCATTGTTTCCCTGACAACGAAA 
ACAAAACAGTTTTGGGGGTTCAGGAGGGGAANTCCAGCCTACCCAGGAAGTTTGCAGAAACA 
GTGCAAGGAAGGGCAGGANTTCCTGGTTGAGNTTTTTGNTAAAACATGGACATGNTTCAGTG 
CTGCTCNTGAGAGAGTAGCAGGTTACCACTTTTGGCAGGCCCCAGCCCTGCAGCAAGGAGGA 
AGAGGACTCAAAAGTTTGGCCTTTCACTGAGCCTCCACAGCAGTGGGGGAGAAGCAAGGGTT 
GGGCCCAGTGTCCCCTTTCCCCAGTGACACCTCAGCCTTGGCAGCCCTGATAACTGGTNTNT 
GGCTGCAANTTAATGCTNTGATATGGCTTTTAGCATTTATTATATGAAAATAGCAGGGTTTT 
AGTTTTTAATTTATCAGAGACCCTGCCACCCATTCCATNTCCATCCAAG 



FIGURE 32 



CATCCTGCAACATGGTGAAACCACGCCTGGCTAATTTTGTTGTATTTTTGGTAGAGATGGGA 
TTTCACCGTGTTAGCCAGGATTGTCTCAATCTGACCTCATGATCTGCCCGCCTCGGCCTCCC 
AAAGTGCTGGGATTACAGGCGAGTGCAACCACACCCGGCCACAAACTTTTTAAGAAGTTAAT 
GAAACCATACCTTTTACATTTTTAATGACAGGAAAATGCTCACAATAATTGTTAACCCAAAA 
TTCTGGATACAAAAGTACAATCTTTACTGTGTAAATACATGTATATGTACTATATGAAAATA 
TACCAAATATCAATAATACTTATCTCTGGGTAAAAACCTCTTCTCATACCCTGTGCTAACAA 
CTTTTAACAAAAAATTTGCATCACTTTTAAGAATCAAGAAAAATTTCTGAAGGTCATATGGG 
ACAGAAAAAAAAACCAAGGGAAAAATCACGCCACTTGGGAAAAAAAGATTCGAAATCTGCCT 
TTTTATAGATTTGTAATTAATAAGGTCCAGGCTTTCTAAGCAACTTAAATGTTTTGTTTCGA 
AACAAAGTACTTGTCTGGATGTAGGAGGAAAGGGAGTGATGTCACTGCCATTATGATGCCCC 
TTGAATATAAGACCCTACTTGCTATCTCCCCTGCACCAGCCAGGAGCCACCCATCCTCCAGC 
ACACTGAGCAGCAAGCTGGACACACGGCACACTGATCCAAATGGGTAAGGGGATGGTGGCGA 
TGCTCATTCTGGGTCTGCTACTTCTGGCGCTGCTCCTACCCGTGCAGGTTTCTTCATTTGTT 
CCTTTAACCAGTATGCCGGAAGCTACTGCAGCCGAAACCACAAAGCCCTCCAACAGTGCCCT 
ACAGCCTACAGCCGGTCTCCTTGTGGTCTTGCTTGCCCTTCTACATCTCTACCATTAAGAGG 
CAGGTCAAGAAACAGCTACAGTTCTCCAACCCATACACTAAAACCGAATCCAAATGGTGCCT 
AGAAGTTCAATGTGGCAAGGAAAAAAACCAGGTCTTCATCAAATCTACTAATTTCACTCCTT 
ATTAACAGAGAAACGCTTGAGAGTCTCAAACTGGACTGGTTTAAAGAGCATCTGAAGGATTT 
GACTAGATGATAAATGCCTGTACTCCCAGTACTTTGGGAGGCCTAGGCCGGCGGATCACCTG 
AGGTCAGGAGTTTGAGACTAACCTGGCCAAAATGGTGAAACCCCATCTGTACTAAAAATACA 
AATATTGACTGGGCGTGGTGGTGAGTGCCTGTGATCCCAGCTACTCAGGTGGCTGAAGCAGG 
ACAATCACTTGAACTCAGGAGGCAGAGGTTGCAGTGAGCTGAGATCGCGCTACTGCACTCTA 
GCCTAGCCTGGGCAACAGAGTGAGACTTCGTCTCAAAAAAAAAAAAGCCAAGTGCAGTGGCT 
CACGCCTGTAATCCCGGCACTTTGGGAGGCCGAGGTGGGCGGATCACGAGGTCAGGAGATCA 
AGACCATCCTGGCTAATACAGTGAAACCCTGTCTCTACTAAAAATACAAAAAATTAGCCGGG 
GATGGTGGCAGGCACCTGGAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATAGCGTGAA 
CTCAGGAGGCGGAGCTTGCAGTGAGCCGAGATTGCGCTACTGCACTCCAGCCTGGGCGACAG 
CGCGAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAAAAZy^ 



FIGURE 33 

>< / usr/ seqdb2 / ss t /DNA/Dnaseqs . min/ss . DNA4 8309 
xsubunit 1 of 1, 67 aa, 1 stop 
xMW: 6981, pi: 7.47, K!X(S/T): 0 

MGKGMVAMLILGLLLLALLLPVQVSSFVPLTSMPEATAAETTKPSNSALQPTAGLLWLLAL 
LHLYH 

Signal peptide: 

amino acids 15-27 



FIGURE 34 



GCCGCGGCGAGAGCGCGCCCAGCCCCGCCGCGATGCCCGCGCGCCCaGGACGCCTCCTCCCGCTGCTGGCCCGGC 

CGGCGGCCCTGACTGCGCTGCTGCTGCTGCTGCTGGGCCATGGCGGCGGCGGGCGCTGGGGCGCCCGGGCCCAGG 

AGGCGGCGGCGGCGGCGGCGGACGGGCCCCCCGCGGCAGACGGCGAGGACGGACAGGACCCGCACAGC^ 

TGTACACGGCCGACATGTTCACGC^ICGGGATCCAGAGCGCCGCGCACTTCGTCATGTTCTTCGCGCCCTGGTGTG 

GACAOTGCCaGCGGCTGCAGCCGACTTGGAATGACCTGGGAGACAAATACAACaGCSlTGGAA^ 

ATGTGGCTAAAGTGGACTGCACGGCCCACTCCGACGTGTGCTCCGCCCAGGGGGTGCGAGGATACCCCACCTTAA 

AGCTTTTCaAGCCAGGCCAAGAAGCTGTGAAGTACCAGGGTCCTCGGGACn'TCCaGACACTGGAaAACTGGATGC 

TGCAGACACTGAACGAGGAGCCAGTGACACCaiGAGCCGGAAGTGGAACCGCCCAGTGCCCCCGAGCTCAAGC^ 

GGCTGTATGAGCTCTCAGCaAGaU^CTTTGAGCTGCACGTTGCAO^GGCGACCaOTTTATC^ 

CGTGGTGTGGTCACTGCAAAGCCCTGGCTCCAACCTGGGAGCAGCTGGCTCTGGGCCTTGAACATTCCGAAACTG 

TCmGATTGGO^GGTTGATTGTACACAGCACTATGAACTCTGCTCCGGAAACCyiGGTTCGTGGCTATCCCACTC 

TTCTCTGGTTCCGAGATGGGAAAAAGGTGGATCAGTACAAGGGAAAGCGGGATTTGGAGTCACTGAGGGAGTACG 

TGGAGTCGCa.GCTGCAGCGCACAGAGACTGQAGCGACGGAGACCGTCACGCCCTC:»VGAGGCCCCGGTGCTGG(:a.G 

CTGAGCCCGAGGCTGACaAGGGCACTGTGTTGGCACTCa.CTGZU^TAACTTCGATGACACCATTGCSVGAAGGAA 

TAACCTTCATCAAGTTTTATGCTCCATGGTGTGGTCATTGTAAGACTCTGGCTCCTACTTGGGAGGAACTCTCTA 

AAAAGGAATTCCCTGGTCTGGCGGGGGTCAAGATCGCCGAAGTAGACTGaVCTGCTGAACGGAATATCTGCAGCA 

AGTATTCGGTACGAGGCTACCCCaCGTTATTGCTTTTCCGAGGAGGGiyiGAAAGTCAGTGAGCACAGTGGAGGCA 

GAGACCTTGACTCGTTACACCGCTTTGTCCTGAGCCAAGCGAAAGACGAACTTTAGGAACACAGTTGGAGGTCAC 

CTCTCCrGCCCaGCTCCCGCa.CCCTGCGTTTAGQAGTTC::AGTCCCACa.GAGGCCACTGGGTTCC{:A^ 

GTTCAGAAAGCAGAACATACTAAGCGTGAGGTATCTTCTTTGTGTGTGTGTTTTCCAAGCCAAaVCACTCTA^ 

ATTCTTTATTAAGTTAAGTTTCTCTAAGTAAATGTGTAACTGATGGTO^CTGTGTAAACSITTTTCAGTGGCGATA 

TATCCCCTTTGACCTTOTCTTGATGAAATTTACATGGTTTCCTTTGAGACTAAAATAGCGTTGAGGGAAATGAAA 

TTGCTGGACTATTTGTGGCTCCTGAGTTGAGTGATTTTGGTGAAJ^GAAAGCACATCCAAAGCATAGTTTACCTGC 

CCACQAGTTCTG6AAAGGTGGCCTTGTGGCAGTATTGACGTTCCTCTGATCTTAAGGTCACAGTTGACTCAATAC 

TGTGTTGGTCCGTAGCATGGAGCyiGATTGAAATGCAAAAACCCACACCTCTGGAAGATACCTTCACGGCCGCTGC 

TGGAGCTTCTGTTGCTGTGAATACTTCTCTCAGTGTGAGAGGTTAGCCGTGATGAAAGCAGCGTTACTTCTGACC 

GTGCCTGAGTAAGAGAATGCTGATGCCATAACTTTATGTGTCGATACTTGTCAAATCAGTTACTGTTCAGGGGAT 

CCTTCTGTTTCTCa.CGGGGTGAAACATGTCTTTAGTTCCTCATGTTAACACGAAGCCAGAGCCC:aCATGA^ 

TGGATGTCTTCCTTAGAAAGGGTAGGCATGGAAAATTCCACGAGGCTCATTCTCAGTATCTCATTAACTCATTGA 

AAGATTCCAGTTGTATTTGTCACCTGGGGTGACAAGACCAGAC^^GGCTTTCCCSkGGCCTGGGTATCCAGC^ 

TCTGCAGCCCTGCTGAAGGGCCCTAACTAGAGTTCTAGAGTTTCTGATTCTGTTTCTCAGTAGTCCTTTTAGAGG 

CTTGCTATACTTGGTCTGCTTCAAGGAGGTCGACCTTCTAATGTATGAAGAATGGGATGCy^TTTGATCTCa^ 

CAAAGACAGATGTCAGTGGGCTGCTCTGGCCCTGGTGTGCACGGCTGTGGCAGCTGTTGATGCCAGTGTCCTCTA 

ACTCATGCTGTCCTTGTGATTAAACACCTCTATCTCCCTTGGGAATAAGCACATACAGGCTTAAGCTCTAAGATA 

GATAGGTGlTTGTCCTTTTACCTiTCGAGCTACTTCCCATAATAACCACTTTGCATCCAACyiCTCTTCACCC^ 

CCCATACGCAAGGGGATGTGGATACTTGGCCCaU^GTAACTGGTGGTAGGAATCTTAGAAAC^ 

CrGTCTGTCTGAGGCAGaAGATAACAGCAGCATCTCGACCAGCCTCTGCCTTAAAGGAAATCT^ 

TATGGTTC»LCAGATAATTCTTTTTTTAAAAA?^CCCAACCTCCTAGAGAAGCACy^CTGTCAAGAGTCT 

CACAACTTOlGCTTTGCATCACGAGTCTTGTATTCCAAGAAAATCAftAGTGGTACAATTTGT^ 

GATACTTTCTAAATAAACTCTTTTTTTTTAA 



FIGURE 35 

></usr / seqdb2 /ss t /DNA/Dnaseqs . min/ ss . DNA4 6776 
xsubunit 1 of 1, 432 aa, 1 stop 
xMW: 47629, pi: 5.90, NX(S/T): 0 

MPARPGRLLPLLARPAALTALLLLLLGHGGGGRWGARAQEAAAAAADGPPAADGEDGQDPHS 
KHLYTADMFTHGIQSAAHFVMFFAPWCGHCQRLQPTimDLGDKYNSMEDAKVYVAKVDCTAH 
SDVCSAQGVRGYPTLKLFKPGQEAVKYQGPRDFQTLENWMLQTLNEEPVTPEPEVEPPSAPE 
LKQGLYELSASNFELHVAQGDHFIKFFAPWCGHCKALAPTWEQLALGLEHSETVKIGKVDCT 
QHYELCSGNQVRGYPTLLWFRDGKKVDQYKGKRDLESLREYVESQLQRTETGATETVTPSEA 
PVLAAEPEADKGTVLALTENNFDDTIAEGITFIKFYAPWCGHCKTLAPTWEELSKKEFPGLA 
GVKIAEVDCTAERNICSKYSVRGYPTLLLFRGGKKVSEHSGGRDLDSLHRFVLSQAKDEL 



Signal sequence: 

amino acids 1-32 



FIGURE 36 



CTTTTCTGAGGAACCACAGCAATGAATGGCTTTGCATCCTTGCTTCGAAGAAACCAATTTAT 
CCTCCTGGTACTATTTCTTTTGCAAATTCAGAGTCTGGGTCTGGATATTGATAGCCGTCCTA 
CCGCTGAAGTCTGTGCCACACACACAATTTCACCAGGACCCAAAGGAGATGATGGTGAAAAA 
GGAGATCCAGGAGAAGAGGGAAAGCATGGCAAAGTGGGACGCATGGGGCCGAAAGGAATTAA 
AGGAGAACTGGGTGATATGGGAGATCAGGGCAATATTGGCAAGACTGGGCCCATTGGGAAGA 
AGGGTGACAAAGGGGAAAAAGGTTTGCTTGGAATACCTGGAGAAAAAGGCAAAGCAGGTACT 
GTCTGTGATTGTGGAAGATACCGGAAATTTGTTGGACAACTGGATATTAGTATTGCTCGGCT 
CAAGACATCTATGAAGTTTGTCAAGAATGTGATAGCAGGGATTAGGGAAACTGAAGAGAAAT 
TCTACTACATCGTGCAGGAAGAGAAGAACTACAGGGAATCCCTAACCCACTGCAGGATTCGG 
GGTGGAATGCTAGCCATGCCCAAGGATGAAGCTGCCAACACACTCATCGCTGACTATGTTGC 
CAAGAGTGGCTTCTTTCGGGTGTTCATTGGCGTGAATGACCTTGAAAGGGAGGGACAGTACA 
TGTCCACAGACAACACTCCACTGCAGAACTATAGCAACTGGAATGAGGGGGAACCCAGCGAC 
CCCTATGGTCATGAGGACTGTGTGGAGATGCTGAGCTCTGGCAGATGGAATGACACAGAGTG 
CCATCTTACCATGTACTTTGTCTGTGAGTTCATCAAGAAGAAAAAGTAACTTCCCTCATCCT 
ACGTATTTGCTATTTTCCTGTGACCGTCATTACAGTTATTGTTATCCATCCTTTTTTTCCTG 
ATTGTACTACATTTGATCTGAGTCAACATAGCTAGAAAATGCTAAACTGAGGTATGGAGCCT 
CCATCATCAAAAAAAAAAAAAAAA 



FIGURE 37 

></usr/ seqdb2/sst/DNA/Dnaseqs .min/ss.DNA50980 
xsubunit 1 of 1, 277 aa, 1 stop 
xMW: 30645, pi: 7.47, NX(S/T): 2 

MNGFASLLRRNQFILLVLFLLQIQSLGLDIDSRPTAEVCATHTISPGPKGDDGEKGDPGEEG 
KHGKVGRMGPKGIKGELGDMGDQGNIGKTGPIGKKGDKGEKGLLGIPGEKGKAGTVCDCGRY 
RKFVGQLDISIARLKTSMKFVKNVIAGIRETEEKFYYIVQEEKNYRESLTHCRIRGGMLAMP 
KDEAANTLIADYVAKSGFFRVFIGVNDLEREGQYMSTDNTPLQNYSNWNEGEPSDPYGHEDC 
VEMLS SGRWNDTECHLTMYFVCEFI KKKK 

Signal peptide: 

amino acids 1-25 



FIGURE 38 

GGTTCTATCGATTCGAATTCGGCCACACTGGCCGGATCCTCTAGAGATCCCTCGACCTCGAC 
CCACGCGTCCGCTGCTCTCCGCCCGTGTGGAGTGGTGGGGGCCTGGGTGGGAATGGGCGTGT 
GCCAGCGCACGCGCGCTCCCTGGAAGGAGAAGTCTCAGCTAGAACGAGCGGCCCTAGGTTTT 
CGGAAGGGAGGATCAGGGATGTTTGCGAGCGGCTGGAACCAGACGGTGCCGATAGAGGAAGC 
GGGCTCCATGGCTGCCCTCCTGCTGCTGCCCCTGCTGCTGTTGCTACCGCTGCTGCTGCTGA 
AGCTACACCTCTGGCCGCAGTTGCGCTGGCTTCCGGCGGACTTGGCCTTTGCGGTGCGAGCT 
CTGTGCTGCAAAAGGGCTCTTCGAGCTCGCGCCCTGGCCGCGGCTGCCGCCGACCCGGAAGG 
TCCCGAGGGGGGCTGCAGCCTGGCCTGGCGCCTCGCGGAACTGGCCCAGCAGCGCGCCGCGC 
ACACCTTTCTCATTCACGGCTCGCGGCGCTTTAGCTACTCAGAGGCGGAGCGCGAGAGTAAC 
AGGGCTGCACGCGCCTTCCTACGTGCGCTAGGCTGGGACTGGGGACCCGACGGCGGCGACAG 
CGGCGAGGGGAGCGCTGGAGAAGGCGAGCGGGCAGCGCCGGGAGCCGGAGATGCAGCGGCCG 
GAAGCGGCGCGGAGTTTGCCGGAGGGGACGGTGCCGCCAGAGGTGGAGGAGCCGCCGCCCCT 
CTGTCACCTGGAGCAACTGTGGCGCTGCTCCTCCCCGCTGGCCCAGAGTTTCTGTGGCTCTG 
GTTCGGGCTGGCCAAGGCCGGCCTGCGCACTGCCTTTGTGCCCACCGCCCTGCGCCGGGGCC 
CCCTGCTGCACTGCCTCCGCAGCTGCGGCGCGCGCGCGCTGGTGCTGGCGCCAGAGTTTCTG 
GAGTCCCTGGAGCCGGACCTGCCCGCCCTGAGAGCCATGGGGCTCCACCTGTGGGCTGCAGG 
CCCAGGAACCCACCCTGCTGGAATTAGCGATTTGCTGGCTGAAGTGTCCGCTGAAGTGGATG 
GGCCAGTGCCAGGATACCTCTCTTCCCCCCAGAGCATAACAGACACGTGCCTGTACATCTTC 
ACCTCTGGCACCACGGGCCTCCCCAAGGCTGCTCGGATCAGTCATCTGAAGATCCTGCAATG 
CCAGGGCTTCTATCAGCTGTGTGGTGTCCACCAGGAAGATGTGATCTACCTCGCCCTCCCAC 
TCTACCACATGTCCGGTTCCCTGCTGGGCATCGTGGGCTGCATGGGCATTGGGGCCACAGTG 
GTGCTGAAATCCAAGTTCTCGGCTGGTCAGTTCTGGGAAGATTGCCAGCAGCACAGGGTGAC 
GGTGTTCCAGTACATTGGGGAGCTGTGCCGATACCTTGTCAACCAGCCCCCGAGCAAGGCAG 
AACGTGGCCATAAGGTCCGGCTGGCAGTGGGCAGCGGGCTGCGCCCAGATACCTGGGAGCGT 
TTTGTGCGGCGCTTCGGGCCCCTGCAGGTGCTGGAGACATATGGACTGACAGAGGGCAACGT 
GGCCACCATCAACTACACAGGACAGCGGGGCGCTGTGGGGCGTGCTTCCTGGCTTTACAAGC 
ATATCTTCCCCTTCTCCTTGATTCGCTATGATGTCACCACAGGAGAGCCAATTCGGGACCCC 
CAGGGGCACTGTATGGCCACATCTCCAGGTGAGCCAGGGCTGCTGGTGGCCCCGGTAAGCCA 
GCAGTCCCCATTCCTGGGCTATGCTGGCGGGCCAGAGCTGGCCCAGGGGAAGTTGCTAAAGG 
ATGTCTTCCGGCCTGGGGATGTTTTCTTCAACACTGGGGACCTGCTGGTCTGCGATGACCAA 
GGTTTTCTCCGCTTCCATGATCGTACTGGAGACACCTTCAGGTGGAAGGGGGAGAATGTGGC 
CACAACCGAGGTGGCAGAGGTCTTCGAGGCCCTAGATTTTCTTCAGGAGGTGAACGTCTATG 
GAGTCACTGTGCCAGGGCATGAAGGCAGGGCTGGAATGGCAGCCCTAGTTCTGCGTCCCCCC 
CACGCTTTGGACCTTATGCAGCTCTACACCCACGTGTCTGAGAACTTGCCACCTTATGCCCG 
GCCCCGATTCCTCAGGCTCCAGGAGTCTTTGGCCACCACAGAGACCTTCAAACAGCAGAAAG 
TTCGGATGGCAAATGAGGGCTTCGACCCCAGCACCCTGTCTGACCCACTGTACGTTCTGGAC 
CAGGCTGTAGGTGCCTACCTGCCCCTCACAACTGCCCGGTACAGCGCCCTCCTGGCAGGAAA 
CCTTCGAATCTGAGAACTTCCACACCTGAGGCACCTGAGAGAGGAACTCTGTGGGGTGGGGG 
CCGTTGCAGGTGTACTGGGCTGTCAGGGATCTTTTCTATACCAGAACTGCGGTCACTATTTT 
GTAATAAATGTGGCTGGAGCTGATCCAGCTGTCTCTGACCTAAAAAAAA?^AAAAAAAAAAAA 
AAAAAAAAAGGGCGGCCGCGACTCTAGAGTCGACCTGCAGTAGGGATAACAGGGTAATAAGC 
TTGGCCGCCATGGCCCAACTTGTTTATTGCAG 



FIGURE 39 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA50913 
xsiibunit 1 of 1, 730 aa, 1 stop 
xMW: 78644, pi: 7.65, NX{S/T): 2 

MGVCQRTRAPWKEKSQLERAALGFRKGGSGMFASGWNQTVPIEEAGSMAALLLLPLLLLLPL 
LLLKLHLWPQLRWLPADLAFAVIlALCCKRALRARALAAAAADPEGPEGGCSIiAWRLAELAQQ 
RAAHTFLIHGSRRFSYSEAERESNRAARAFLRALGWDWGPDGGDSGEGSAGEGERAAPGAGD 
AAAGSGAEFAGGDGAARGGGAAAPLSPGATVALLLPAGPEFLWLWFGLAKAGLRTAFVPTAL 
RRGPLLHCLRSCGARALVLAPEFLES LEPDLPALRAMGLHLWAAGPGTHPAGI SDLIiAEVSA 
EVDGPVPGYLSSPQSITDTCLYIFTSGTTGLPKAARISHLKILQCQGFYQLCGVHQEDVIYL 
ALPLYHMSGSLLGIVGCayiGIGATVVLKSKFSAGQFWEDCQQHRVTVFQYIGEIjCRYLVNQPP 
SKAERGHKVRLAVGSGLRPDTWERFVRRFGPLQVLETYGLTEGNVATINYTGQRGAVGRASW 
LYKHIFPFSLIRYDVTTGEPIRDPQGHCMATSPGEPGLLVAPVSQQSPFLGYAGGPELAQGK 
LLKDVFRPGDVFFNTGDLLVCDDQGFLRFHDRTGDTFRWKGENVATTEVAEVFEALDFLQEV 
NVYGVTVPGHEGRAGMAALVLRPPHALDLMQLYTHVSENLPPYARPRFLRLQESLATTETFK 
QQKVRMANEGFDPSTLSDPLYVLDQAVGAYLPLTTARYSALLAGNLRI 

Type II transmesnbrane domain: 

amino acids 45-65 

Other transmembrane domain: 

amino acids 379-398 

CAMP- and cGMP- dependent protein kinase phosphorylation site 

starting at amino acid 136 

CUB domain protein motif 

amino acids 254-261 

putative AMP -binding domain siganture 

amino acids 332-343 



N-glycosylation sites 

amino acids 37-40 and 483-486 



FIGURE 40 

CCTGTGTTAAGCTGAGGTTTCCCCTAGATCTCGTATATCCCCAACACATACCTCCACGCACA 
CACATCCCCAAGAACCTCGAGCTCACACCAACAGACACACGCGCGCATACACACTCGCTCTC 
GCTTGTCCATCTCCCTCCCGGGGGAGCCGGCGCGCGCTCCCACCTTTGCCGCACACTCCGGC 
GAGCCGAGCCCGCAGCGCTCCAGGATTCTGCGGCTCGGAACTCGGATTGCAGCTCTGAACCC 
CCATGGTGGTTTTTTAAACACTTCTTTTCCTTCTCTTCCTCGTTTTGATTGCACCGTTTCCA 
TCTGGGGGCTAGAGGAGC2^GGCAGCAGCCTTCCCAGCCAGCCCTTGTTGGCTTGCCATCGT 
CCATCTGGCTTATAAAAGTTTGCTGAGCGCAGTCCAGAGGGCTGCGCTGCTCGTCCCCTCGG 
CTGGCAGAAGGGGGTGACGCTGGGCAGCGGCGAGGAGCGCGCCGCTGCCTCTGGCGGGCTTT 
CGGCTTGAGGGGCAAGGTGAAGAGCGCACCGGCCGTGGGGTTTACCGAGCTGGATTTGTATG 
TTGCACCATGCCTTCTTGGATCGGGGCTGTGATTCTTCCCCTCTTGGGGCTGCTGCTCTCCC 
TCCCCGCCGGGGCGGATGTGAAGGCTCGGAGCTGCGGAGAGGTCCGCCAGGCGTACGGTGCC 
AAGGGATTCAGCCTGGCGGACATCCCCTACCAGGAGATCGCAGGGGAACACTTAAGAATCTG 
TCCTCAGGAATATACATGCTGCACCACAGAAATGGAAGACAAGTTAAGCCAACAAAGCAAAC 
TCGAATTTGAAAACCTTGTGGAAGAGACAAGCCATTTTGTGCGCACCACTTTTGTGTCCAGG 
CATAAGAAATTTGACGAATTTTTCCGAGAGCTCCTGGAGAATGCAGAAAAGTCACTAAATGA 
TATGTTTGTACGGACCTATGGCATGCTGTACATGCAGAATTCAGAAGTCTTCCAGGACCTCT 
TCACAGAGCTGAAAAGGTACTACACTGGGGGTAATGTGAATCTGGAGGAAATGCTCAATGAC 
TTTTGGGCTCGGCTCCTGGAACGGATGTTTCAGCTGATAAACCCTCAGTATCACTTCAGTGA 
AGACTACCTGGAATGTGTGAGCAAATACACTGACCAGCTCAAGCCATTTGGAGACGTGCCCC 
GGAAACTGAAGATTCAGGTTACCCGCGCCTTCATTGCTGCCAGGACCTTTGTCCAGGGGCTG 
ACTGTGGGCAGAGAAGTTGCAAACCGAGTTTCCAAGGTCAGCCCAACCCCAGGGTGTATCCG 
TGCCCTCATGAAGATGCTGTACTGCCCATACTGTCGGGGGCTTCCCACTGTGAGGCCCTGCA 
ACAACTACTGTCTCAACGTCATGAAGGGCTGCTTGGCAAATCAGGCTGACCTCGACACAGAG 
TGGAATCTGTTTATAGATGCAATGCTCTTGGTGGCAGAGCGACTGGAGGGGCCATTCAACAT 
TGAGTCGGTCATGGACCCGATAGATGTCAAGATTTCTGAAGCCATTATGAACATGCAAGAAA 
ACAGCATGCAGGTGTCTGCAAAGGTCTTTCAGGGATGTGGTCAGCCCAAACCTGCTCCAGCC 
CTCAGATCTGCCCGCTCAGCTCCTGAAAATTTTAATACACGTTTCAGGCCCTACAATCCTGA 
GGAAAGACCAACAACTGCTGCAGGCACAAGCTTGGACCGGCTGGTCACAGACATAAAAGAGA 
AATTGAAGCTCTCTAAAAAGGTCTGGTCAGCATTACCCTACACTATCTGCAAGGACGAGAGC 
GTGACAGCGGGCACGTCCAACGAGGAGGAATGCTGGAACGGGCACAGCAAAGCCAGATACTT 
GCCTGAGATCATGAATGATGGGCTCACCAACCAGATCAACAATCCCGAGGTGGATGTGGACA 
TCACTCGGCCTGACACTTTCATCAGACAGCAGATTATGGCTCTCCGTGTGATGACCAACAAA 
CTAAAAAACGCCTACAATGGCAATGATGTCAATTTCCAGGACACAAGTGATGAATCCAGTGG 
CTCAGGGAGTGGCAGTGGGTGCATGGATGACGTGTGTCCCACGGAGTTTGAGTTTGTCACCA 
CAGAGGCCCCCGCAGTGGATCCCGACCGGAGAGAGGTGGACTCTTCTGCAGCCCAGCGTGGC 
CACTCCCTGCTCTCCTGGTCTCTCACCTGCATTGTCCTGGCACTGCAGAGACTGTGCAGATA 
ATCTTGGGTTTTTGGTCAGATGAAACTGCATTTTAGCTATCTGAATGGCCAACTCACTTCTT 
TTCTTACACTCTTGGACAATGGACCATGCCACAAAAACTTACCGTTTTCTATGAGAAGAGAG 
CAGTAATGCAATCTGCCTCCCTTTTTGTTTTCCCAAAGAGTACCGGGTGCCAGACTGAACTG 
CTTCCTCTTTCCTTCAGCTATCTGTGGGGACCTTGTTTATTCTAGAGAGAATTCTTACTCAA 
ATTTTTCGTACCAGGAGATTTTCTTACCTTCATTTGCTTTTATGCTGCAGAAGTAAAGGAAT 
CTCACGTTGTGAGGGTTTTTTTTTTCTCATTTAA?^T 
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></usr/seqdb2/sst/DNA/Dnaseqs .tnin/ss .DNA50914 
xsubunit 1 of 1, 555 aa, 1 stop 
xMW: 62736, pi: 5.36, NX{S/T) : 0 

MPSWIGAVILPLLGLLLSLPAGADVKARSCGEVRQAYGAKGFSLADIPYQEIAGEHLRICPQ 

EYTCCTTEMEDKLSQQSKLEFENLVEETSHFVRTTFVSRHKKFDEFFRELLENAEKSLNDMF 

VRTYGMLYMQNSEVFQDLFTELKRYYTGGNVNLEEMLNDFWARLLERMFQLINPQYHFSEDY 

LECVSKYTDQLKPFGDVPRKLKIQVTRAFIAARTFVQGLTVGREVANRVSKVSPTPGCIRAL 

MKMLYCPYCRGLPTWPCNNYCLNVMKGCLANQADLDTEWNLFIDAMLLVAERLEGPFNIES 

VMDP IDVKI SEAIMNMQENSMQVSAKVFQGCGQPKPAPALRSARSAPENFNTRFRPYNPEER 

PTTAAGTSLDRLVTDIKEKLKLSKKVWSALPYTICKDESVTAGTSNEEECWNGHSKARYLPE 

IMM)GLTNQimTPEVDVDITRPDTFIRQQIMALRVMTNKLKNAYNGNDVNF 

SGSGCMDDVCPTEFEFVTTEAPAVDPDRREVDSSAAQRGHSLLSWSLTCIVLALQRLCR 



Signal peptide: 

amino acids 1-23 
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CGGACGCGTGGGCGQACGCGTGGGCAAAAGAACTCGGAGTGCCaWlGCTAAATAAGTTAGCTGAGAAAACGCACG 

CAGTTTGCAGCGCCTGCGCCGGGTGCGCCAACTACGO^GACCfiAGCGGGCTCCGCGCGGACCGGC 

TAGGGACCCGGCTTTGGCCTTCAGGCTCCCTAGCAGCGGGGAAAAGGAATTGCTGCCCGGAGTTTCTGCGGAGGT 

GQAGGGAGATCAGGAAACGGCTTCTTCCTCACTTCGCCGCCTGGTGaGTGTCGGGGAGATTGGCaAACGCCTAGG 

AAAGQACTGGGGAAAATAGCCCTGGGAAAGTGGAGAAGGTGATCAGGAGGCCGGTCCACTACGGCAGTTTATCTG 

TCTGATCAGAGCCAGACGCGACGCGTCCa.CTTCGCAGTTCTTTCaVGGTGTGGGGACCGCAGGACAGACGGCCG^ 

TCCCGCCGCCCTCCGTACCAGCACTCCCAGGAGAGTCAGCCTCGCTCCCCAACGTCQAGGGCGCTCTGGCCACGA 

AAAGTTCCTGTCCACTGTGATTCTCAATTCCTTGCTTGGTTTTTTTCTCCAGAGAACTTTTGGGTGGAGATATTA 

ACTTTTTTCTTTTTTTTTTTCCTTGGTGGAAGCTGCTCTAGGQAGGGGGGAGGAGGAGGAGAAAGTGAAATGTGC 

TGGAQAAGAGCGAGCCCTCCTTGTTCTTCCGGAGTCCCATCCATTAAGCCATCACTTCTGGAAQATTAAAGTTGT 

CGGACATGGTGACAGCTGAGAGGAGAGGAGGATTTCTTGCCAGGTGGAGAGTCTTCACCGTCTGTTGGGTGC^^ 

TGTGCGCCCGCAGCGGCGCGGGGCGCGTGGTTCTCCGCGTGGAGTCTCACCTGGGACCTGAGTGAATGGCTCC^ 

GGGGCTGTGCGGGGCATCCGCCTCCGCCTTCTCCACAGGCCTGTGTCTGTCCTGQAAAGATGCTAGCAATGGGGG 

CGCTGGCAGGATTCTGGATCCTOTGCCTCCTCACTTATGGTTACCTGTCCTGGGGCCAGGCCTTAGAAGAGGAGG 

AAGAAGGGGCCTTACTAGCTCAAGCTGGAGAGAAACTAGAGCCCAGCACaACTTCCACCTCCCAGCCCCATCT 

TTTTCATCCTAGCGGATGATCAGGGATTTAGAGATGTGGGTrACCACGGATCTGAGATTAAAACACCTACTCTTG 

ACAAGCTCGCTGCCGAAGGAGTTAAACTGGAGAACTACTATGTCCAGCCTATTTGCACACCATCCAGGAGTCAGT 

TTATTACTGGAAAGTATCAGATACACACCGGACTTCAACATTCTATCATAAGACCTACCCAACCCa^ 

CTCTGGACAATGCCACCCTACCTCaGAAACTGAAGGAGGTTGGATATTCAACGCATATGGTCGGAAAATGGC^^ 

TGGGTTTTAACAGAAAAGAATGCATGCCCACCAGAAGAGGATTTGATACCTTTTTTGGTTCCCTTTTGGQAAGTG 

GGGATTACTATACACACTACAAATGTGACAGTCCTGGGATGTGTGGCTATGACTTGTATGAAAACGACSiATGCTG 

CCTGGGACTATGACAATGGCATATACTCCACACAGATGTACACTCAGAGAGTACAGaWiTC^ 

ACCCCAC»AAGCOTATATTTTTATATACTGCCTATCAAGCTGTTCATTCACCACTGCAAGCTCCTGGCAG 

TCGAACACTACCGATCCATTATCAACaTAAACAGGAGAAGATATGCTGCCATGCTTTCCTGCTTAGaTGAAGC^ 

TCAACAACGTGACATTGGCTCTAAAGACTTATGGTTTCTATAACAACAGCATTATCATTTACTCTTCAGA^ 

GTGGCCAGCCTACGGCAGQAGGGAGTAACTGGCCTCTCAGAGGTAGCAAAGGAACATATTGGGAAGGAGGGAT^^ 

GGGCTGTAGGCTTTGTGCATAGCCCACTTCTGAAAAACAAGGGAACAGTGTGTAAGQAACTT6TGCACATCACTG 

ACTGGTACCCCACTCTCATTTCACTGGCTGAAGGAaiGATTGATGAGGACATTCAACTAGATGGCTATGATATC 

GGGAGACCATAAGTGAGGGTCTTCGCTCACCCCQAGTAGATATTTTGCATAACATTQACCCCTATACACCAAGGC 

AAAAAATGGCTCCTGGGCAGCAGGCTATGGQATCTGGAACACTGCAATCCAGTCAGCCATCA^^ 

GaAATTGCTTACAGQAAATCCTGGCTACAGCGACTGGGTCCCCCCTCAGTCTTTCAGCAACCTGGGACCGAACra 

GTGGCaCAATGAACGGATCACCTTGTCAACTGGCAAAA6TGTATGGCTTTTCAACa.Tav^^ 

QAGGGTGGACCTATCTAA(^GGTATCCAGGAATCGTGAAGAAGCTCCTACGGAGGCTCTCACAGTTCAACAAAAC 

TGCAGTGCCGGTCAGGTATCCCCCCAAAGACCCCAGAAGTAACCCTAGGCTCAATGGAGGGGTCTGGGGACCATG 

GTATAAAGAGGAAACCaAGAAAAAGAAGCO^GCaAAJ^TCAGGCTGAGAAAAAGCAAAAGAAAAGC^^ 

GAAQAAGAAACAGCAGAAAGCAGTCTCAGGTAAACCAGCA^TTTGGCTCGATAATATCGCTGGCCTAAGCGTCA 

GGCTTGTTTTCaTGCTGTGCCACTCCAGAGACTTCTGCCACCTGGCCGCCACACTGAAAACTGTCCTGCTCAGTG 

CCAAGGTGCTACTCTTGCaU^GCCACACTTAQAGAGAGTGQAGATGTTTATTTCTCTCGCTCCTTTAGAAAACGTG 

GTGAGTCCTQAGTTCCACTGCTGTGCTTCAGTCAACTGACCAAACACTGCTTTGAATTATAGGAGGAGAACAATA 

ACCTACCATCCGCAAGCATGCTAATTTGATGGAAGTTACAGGGTAGCATGATTAAAACTACCTTTGATAAATTAC 
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AGTCAAAGATTGTGTO^CCTCAAAGGCCTTGAAGAATATATTTTCTTGGTGAATTTTTGTATGTCTGTCATATGA 

CACTTGGGTTTTTTAATTAATTCTATTTTATATATATAAATATATGTTTCTTTTCCTGTGAAAAGCTGTTTTTCT 

CaCB.TGTGAACAGCTTGCACCTCATTTTACCaTGCGTGAGGGAATGGCAAATAAGAATGTTTGAGC:AC^ 

ACAATGAATGTAACTATTTTCTAAACACTTTACTAGAAGAACATTTCAGTATAAAAAACCTAATTTATTTTTACA 

GAAAAATATTTTGTTGTTTTTATAAAAAGTTATGCAAATGACTTTTATTTTTATTTCCTGCATACCATTAGAAGA 

ATTTTATTTCATTTCTTCAAATTATCAAGCACTGTAATACTATAAATTAATGTAATACTGTGTGAATTCAGACTA 

TAAAAAACATCaTTCAGAAAACTTTATAATCGTCy^TTGTTCAATCAAGATTTTGAATGTAATAAGATGAATATAT 

ATTACTTGGAAATTCAATGTTTGTGCAGAGTTGAGACAACTTTATTGTTTCTATCATAAACTATTTATGTATCTT 

AATTATTAAAATGATTTACTTTATGGCACTAGAAAATTTACTGTGGCTTTTCTGATCTAACTTCTAGCTAAAATT 

GTATCATTGGTCCTAAAAAATAAAAATCTTTACTAATAGGCAATTGAAGGAATGGTTTGCTAACAACCAC^ 

TATAATATGATTTTACAGATAGATGCTTCCCCTTGGCTATGACATGGAQAAAGATTTTCCCATAATAATAACTAA 

TATTTATATTAGGTTGGTGCAAAACTAGTTGCGGTTTTTCCCATTAAAAGTAATAACCTTACTCTTATACAAAGT 

GGACACTGTGGGGAGATACAGAQAAATGGAAGATACGQATCCTGCCTGGAGTAGGTAACCTTGCTTGGAAACCCC 

ACATGCAAACGTCTkTGAGQAGAATTAAAGGAGTATTATCAGTAATaaAGTTTATCATGGGTCATCaATGAGCa 

GATTGGTGTGGATCCTGTAGACCCTGGTGTTTTCTTTGAAGTGCCCTCTCCTAATGCAGAGGCCTTGAAGCTTAC 

AGTATACACTTGAAAAGTCACAGATAGCTAGAATTATGATCTTTGAAGTTATAACTGTGATCTGAAAATGTGTGT 

GGTGGTATGACAGCATACCATTAAATACa.TTTACaTCACAGCTCAAAGGACTGTGATATAATC 

AACTCaAAGGACTGTGATATAATCCATTTATATCACa.GCTC:ACAGTTTCTGAAAATGTATAAAAGAATCTATAAT 

CTAGTACTGAAATTACTAAATTGGGTAAGATGATTTAAATGATTTTAATTTTAACATTTTATTTCTAGAATATAT 

GGCTCCATTTTATTTTATAGTGTAAAGTTGTATTTCCTAAAGTTTGTGTTTTGTCGACAGTATCTTTTAAATGAG 

TCTTAAAAATAAAGGCATATTGTTCATGTTTAAAAAAAAAAAAA2UWWiAAAAAAAAAAAAAAAAAAA2\AAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 43 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA48296 
xsiibunit 1 of 1, 515 aa, 1 stop 
xMW: 56885, pi: 6.49, NX(S/T): 5 

MAPRGCAGHPPPPSPQACVCPGKMLAMGALAGFWILCLLTYGYLSWGQALEEEEEGALLAQA 
GEKLEPSTTSTSQPHLIFILADDQGFRDVGYHGSEIKTPTLDKLAAEGVKLENYYVQPICTP 
SRSQFITGKYQIHTGLQHS I IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFNRKEC 
MPTRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 
ILASHNPTKPIFLYTAYQAVHSPLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLA 
LKTYGFYNNS 1 1 1 YSSDNGGQPTAGGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCK 
ELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSPRVDILHNIDPYTPRQKMAPG 
QQAMGSGTLQSSQPSECSTGNCLQEILATATGSPLSLSATWDRTGGTMNGSPCQLAKVYGFS 
TSQPTHMRGWTYLTGIQES 

Important Features: 
Signal Peptide: 

amino acids 1-3 7 

Sulfatases signature 1. 

amino acids 120-132 

Sulfatases signature 2. 

amino acids 168-177 

Tyrosine kinase phosphorylation site. 

amino acids 163-169 

N-glycosylation sites. 

amino acids 157-160, 306-309 and 318-321 
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CGGACGCGTGGGTGCGAGTGGAGCGGAGGACCCGAGCGGCTGAGGAGAGAGGAGGCGGCGGC 
TTAGCTGCTACGGGGTCCGGCCGGCGCCCTCCCGAGGGGGGCTCAGGAGGAGGAAGGAGGAC 
CCGTGCGAGAATGCCTCTGCCCTGGAGCCTTGCGCTCCCGCTGCTGCTCTCCTGGGTGGCAG 
GTGGTTTCGGGAACGCGGCCAGTGCAAGGCATCACGGGTTGTTAGCATCGGCACGTCAGCCT 
GGGGTCTGTCACTATGGAACTAAACTGGCCTGCTGCTACGGCTGGAGAAGAAACAGCAAGGG 
AGTCTGTGAAGCTACATGCGAACCTGGATGTAAGTTTGGTGAGTGCGTGGGACCAAACAAAT 
GCAGATGCTTTCCAGGATACACCGGGAAAACCTGCAGTCAAGATGTGAATGAGTGTGGAATG 
AAACCCCGGCCATGCCAACACAGATGTGTGAATACACACGGAAGCTACAAGTGCTTTTGCCT 
CAGTGGCCACATGCTCATGCCAGATGCTACGTGTGTGAACTCTAGGACATGTGCCATGATAA 
ACTGTCAGTACAGCTGTGAAGACACAGAAGAAGGGCCACAGTGCCTGTGTCCATCCTCAGGA 
CTCCGCCTGGCCCCAAATGGAAGAGACTGTCTAGATATTGATGAATGTGCCTCTGGTAAAGT 
CATCTGTCCCTACAATCGAAGATGTGTGAACACATTTGGAAGCTACTACTGCAAATGTCACA 
TTGGTTTCGAACTGCAATATATCAGTGGACGATATGACTGTATAGATATAAATGAATGTACT 
ATGGATAGCCATACGTGCAGCCACCATGCCAATTGCTTCAATACCCAAGGGTCCTTCAAGTG 
TAAATGCAAGCAGGGATATAAAGGCAATGGACTTCGGTGTTCTGCTATCCCTGAAAATTCTG 
TGAAGGAAGTCCTCAGAGCACCTGGTACCATCAAAGACAGAATCAAGAAGTTGCTTGCTCAC 
AAAAACAGCATGAAAAAGAAGGCAAAAATTAAAAATGTTACCCCAGAACCCACCAGGACTCC 
TACCCCTAAGGTGAACTTGCAGCCCTTCAACTATGAAGAGATAGTTTCCAGAGGCGGGAACT 
CTCATGGAGGTAAAAAAGGGAATGAAGAGAAATGAAAGAGGGGCTTGAGGAT6AGAAAAGAG 
AAGAGAAAGCCCTGAAGAATGACATAGAGGAGCGAAGCCTGCGAGGAGATGTGTTTTTCCCT 
AAGGTGAATGAAGCAGGTGAATTCGGCCTGATTCTGGTCCAAAGGAAAGCGCTAACTTCCAA 
ACTGGAACATAAAGATTTAAATATCTCGGTTGACTGCAGCTTCAATCATGGGATCTGTGACT 
GGAAACAGGATAGAGAAGATGATTTTGACTGG2^TCCTGCTGATCGAGATAATGCTATTGGC 
TTCTATATGGCAGTTCCGGCCTTGGCAGGTCACAAGAAAGACATTGGCCGATTGAAACTTCT 
CCTACCTGACCTGCAACCCCAAAGCAACTTCTGTTTGCTCTTTGATTACCGGCTGGCCGGAG 
ACAAAGTCGGGAAACTTCGAGTGTTTGTGAAAAACAGTAACAATGCCCTGGCATGGGAGAAG 
ACCACGAGTGAGGATGAAAAGTGGAAGACAGGGAAAATTCAGTTGTATCAAGGAACTGATGC 
TACCAAAAGCATCATTTTTGAAGCAGAACGTGGCAAGGGCAAAACCGGCGAAATCGCAGTGG 
ATGGCGTCTTGCTTGTTTCAGGCTTATGTCCAGATAGCCTTTTATCTGTGGATGACTGAATG 
TTACTATCTTTATATTTGACTTTGTATGTCAGTTCCCTGGTTTTTTTGATATTGCATCATAG 
GACCTCTGGCATTTTAGAATTACTAGCTGAAAAATTGTAATGTACCAACAGAAATATTATTG 
TAAGATGCCTTTCTTGTATAAGATATGCCAATATTTGCTTTAAATATCATATCACTGTATCT 
TCTCAGTCATTTCTGAATCTTTCCNCATTATATTATAAAATNTGGAAANGTCAGTTTATCTC 
CCCTCCTCNGTATATCTGATTTGTATANGTANGTTGATGNGCTTCTCTCTACAACATTTCTA 
GAAAATAGAAAAAAAAGCACAGAGAAATGTTTAACTGTTTGACTCTTATGATACTTCTTGGA 
AACTATGACATCAAAGATAGACTTTTGCCTAAGTGGCTTAGCTGGGTCTTTCATAGCCAAAC 
TTGTATATTTAATTCTTTGTAATAATAA 
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MPLPWSIALPLLLSWVAGGFGNAASARHHGLLASARQPGVCHYGTKLACCYGWRRNSKGVCE 
ATCEPGCKFGECVGPNKCRCFPGYTGKTCSQDVNECGMKPRPCQHRCVNTHGSYKCFCLSGH 
MLMPDATCVNSRTCAMINCQYSCEDTEEGPQCLCPSSGLRLAPNGRDCLDIDECASGKVICP 
YNRRCVNTFGSYYCKCHIGFELQYISGRYDCIDINECTMDSHTCSHHANCFNTQGSFKCKCK 
QGYKGNGLRCSAIPENSVKEVLRAPGTIKDRIKiOLLAHKNSMKEaCAKIKNVTPEPTRTPTPK 
VNLQPFNYEEIVSRGGNSHGGKKGNEEK 

Signal peptide: 
amino acids 1-21 

EGF-like domain cysteine pattern signature. 

amino acids 80-91 

Calci-um-binding EGF-like domains 

amino acids 103-124, 230-251 and 185-206 
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GGGAGCTGCTGCTGTGGCTGCTGGTGCTGTGCGCGCTGCTCCTGCTCTTGGTGCAGCTGCTG 
CGCTTCCTGAGGGCTGACGGCGACCTGACGCTACTATGGGCCGAGTGGCAGGGACGACGCCC 
AGAATGGGAGCTGACTGATATGGTGGTGTGGGTGACTGGAGCCTCGAGTGGAATTGGTGAGG 
AGCTGGCTTACCAGTTGTCTAAACTAGGAGTTTCTCTTGTGCTGTCAGCCAGAAGAGTGCAT 
GAGCTGGAAAGGGTGAAAAGAAGATGCCTAGAGAATGGCAATTTAAAAGAAAAAGATATACT 
TGTTTTGCCCCTTGACCTGACCGACACTGGTTCCCATGAAGCGGCTACCAAAGCT6TTCTCC 
AGGAGTTTGGTAGAATCGACATTCTGGTCAACAATGGTGGAATGTCCCAGCGTTCTCTGTGC 
ATGGATACCAGCTTGGATGTCTACAGAAAGCTAATAGAGCTTAACTACTTAGGGACGGTGTC 
CTTGACAAAATGTGTTCTGCCTCACATGATCGAGAGGAAGCAAGGAAAGATTGTTACTGTGA 
ATAGCATCCTGGGTATCATATCTGTACCTCTTTCCATTGGATACTGTGCTAGCAAGCATGCT 
CTCCGGGGTTTTTTTAATGGCCTTCGAACAGAACTTGCCACATACCCAGGTATAATAGTTTC 
TAACATTTGCCCAGGACCTGTGCAATCAAATATTGTGGAGAATTCCCTAGCTGGAGAAGTCA 
CAAAGACTATAGGCAATAATGGAGACCAGTCCCACAAGATGACAACCAGTCGTTGTGTGCGG 
CTGATGTTAATCAGCATGGCCAATGATTTGAAAGAAGTTTGGATCTCAGAACAACCTTTCTT 
GTTAGTAACATATTTGTGGCAATACATGCCAACCTGGGCCTGGTGGATAACCAACAAGATGG 
GGAAGAAAAGGATTGAGAACTTTAAGAGTGGTGTGGATGCAGACTCTTCTTATTTTAAAATC 
TTTAAGACAAAACATGACTGAAAAGAGCACCTGTACTTTTCAAGCCACTGGAGGGAGAAATG 
GAAAACATGAAAACAGCAATCTTCTTATGCTTCTGAATAATCAAAGACTAATTTGTGATTTT 
ACTTTTTAATAGATATGACTTTGCTTCCAACATGGAATGAAATAAAAAATAAATAATAAAAG 
ATTGCCATGAATCTTGCAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA36343 
xsubunit 1 of 1, 289 aa, 1 stop 
xMW: 32268, pi: 9.21, NX(S/T): 0 

MWWVTGASSGIGEELAYQLSKLGVSLVLSARRVHELERVKRRCLENGNLKEKDILVLPLDL 
TDTGSHEAATKAVLQEFGRIDILVNNGGMSQRSLCMDTSLDVYRKLIELNYLGTVSLTKCVL 
PHMIERKQGKIVTVNSILGIISVPLSIGYCASKHALRGFFNGLRTELATYPGIIVSNICPGP 
VQSNI VENSLAGE WKT IGNNGDQSHKMTTSRCVRLMLI SMANDLKEVWI S EQPFLLVTYLW 
QYMPTWAWWITNKMGKKRIENFKSGVDADSSYFKIFKTKHD 

Important Features: 
Signal Peptide: 

amino acids 1-31 

Transmembrane domain: 

amino acids 136-157 

Tyrosine kinase phosphorylation site. 

106-113 and 107-114 

Homologous region to Short-chain alcohol dehydrogenase 

amino acids 80-90, 131-168, 1-13 and 176-185 
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GCGACGTGGGCACCGCCATCAGCTGTTCGCGCGTCTTCTCCTCCAGGTGGGGCAGGGGTTTC 
GGGCTGGTGGAGCATGTGCTGGGACAGGACAGCATCCTCAATCAATCCAACAGCATATTCGG 
TTGCATCTTCTACACACTACAGCTATTGTTAGGTTGCCTGCGGACACGCTGGGCCTCTGTCC 
TGATGCTGCTGAGCTCCCTGGTGTCTCTCGCTGGTTCTGTCTACCTGGCCTGGATCCTGTTC 
TTCGTGCTCTATGATTTCTGCATTGTTTGTATCACCACCTATGCTATCAACGTGAGCCTGAT 
GTGGCTCAGTTTCCGGAAGGTCCAAGAACCCCAGGGCAAGGCTAAGAGGCACTGAGCCCTCA 
ACCCAAGCCAGGCTGACCTCATCTGCTTTGCTTTGGTCTTCAAGCCGCTCAGCGTGCCTGTG 
GACAGCGTGGCCCCGGCCCCCCCAAGCCTCAGGAGGGCAACACAGTCCCTGGCGAGTGGCCC 
TGGCAGGCCAGTGTGAGGAGGCAAGGAGCCCACATCTGCAGCGGCTCCCTGGTGGCAGACAC 
CTGGGTCCTCACTGCTGCCCACTGCTTTGAAAAGGCAGCAGCAACAGAACTGAATTCCTGGT 
CAGTGGTCCTGGGTTCTCTGCAGCGTGAGGGACTCAGCCCTGGGGCCGAAGAGGTGGGGGTG 
GCTGCCCTGCAGTTGCCCAGGGCCTATAACCACTACAGCCAGGGCTCAGACCTGGCCCTGCT 
GCAGCTCGCCCACCCCACGACCCACACACCCCTCTGCCTGCCCCAGCCCGCCCATCGCTTCC 
CCTTTGGAGCCTCCTGCTGGGCCACTGGCTGGGATCAGGACACCAGTGATGCTCCTGGGACC 
CTACGCAATCTGCGCCTGCGTCTCATCAGTCGCCCCACATGTAACTGTATCTACAACCAGCT 
GCACCAGCGACACCTGTCCAACCCGGCCCGGCCTGGGATGCTATGTGGGGGCCCCCAGCCTG 
GGGTGCAGGGCCCCTGTCAGGGAGATTCCGGGGGCCCTGTGCTGTGCCTCGAGCCTGACGGA 
CACTGGGTTCAGGCTGGCATCATCAGCTTTGCATCAAGCTGTGCCCAGGAGGACGCTCCTGT 
GCTGCTGACCAACACAGCTGCTCACAGTTCCTGGCTGCAGGCTCGAGTTCAGGGGGCAGCTT 
TCCTGGCCCAGAGCCCAGAGACCCCGGAGATGAGTGATGAGGACAGCTGTGTAGCCTGTGGA 
TCCTTGAGGACAGCAGGTCCCCAGGCAGGAGCACCCTCCCCATGGCCCTGGGAGGCCAGGCT 
GATGCACCAGGGACAGCTGGCCTGTGGCGGAGCCCTGGTGTCAGAGGAGGCGGTGCTAACTG 
CTGCCCACTGCTTCATTGGGCGCCAGGCCCCAGAGGAATGGAGCGTAGGGCTGGGGACCAGA 
CCGGAGGAGTGGGGCCTGAAGCAGCTCATCCTGCATGGAGCCTACACCCACCCTGAGGGGGG 
CTACGACATGGCCCTCCTGCTGCTGGCCCAGCCTGTGACACTGGGAGCCAGCCTGCGGCCCC 
TCTGCCTGCCCTATCCTGACCACCACCTGCCTGATGGGGAGCGTGGCTGGGTTCTGGGACGG 
GCCCGCCCAGGAGCAGGCATCAGCTCCCTCCAGACAGTGCCCGTGACCCTCCTGGGGCCTAG 
GGCCTGCAGCCGGCTGCATGCAGCTCCTGGGGGTGATGGCAGCCCTATTCTGCCGGGGATGG 
TGTGTACCAGTGCTGTGGGTGAGCTGCCCAGCTGTGAGGGCCTGTCTGGGGCACCACTGGTG 
CATGAGGTGAGGGGCACATGGTTCCTGGCCGGGCTGCACAGCTTCGGAGATGCTTGCCAAGG 
CCCCGCCAGGCCGGCGGTCTTCACCGCGCTCCCTGCCTATGAGGACTGGGTCAGCAGTTTGG 
ACTGGCAGGTCTACTTCGCCGAGGAACCAGAGCCCGAGGCTGAGCCTGGAAGCTGCCTGGCC 
AACATAAGCCAACCAACCAGCTGCTGACAGGGGACCTGGCCATTCTCAGGACAAGAGAATGC 
AGGCAGGCAAATGGCATTACTGCCCCTGTCCTCCCCACCCTGTCATGTGTGATTCCAGGCAC 
CAGGGCAGGCCCAGAAGCCCAGCAGCTGTGGGAAGGAACCTGCCTGGGGCCACAGGTGCCCA 
CTCCCCACCCTGCAGGACAGGGGTGTCTGTGGACACTCCCACACCCAACTCTGCTACCAAGC 
AGGCGTCTCAGCTTTCCTCCTCCTTTACTCTTTCAGATACAATCACGCCAGCCACGTTGTTT 

AAAATAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA40571 

MLLSSLVSLAGSVYLAWILFFVLYDFCIVCITTYAINVSLMWLSFRKVQEPQGKAKRHGNTV 
PGEWPWQASVRRQGAHICSGSLVADTWVLTAAHCFEKAAATELNSWSWLGSLQREGLSPGA 
EEVGVAALQLPRAYNHYSQGSDLALLQLi^HPTTHTPLCLPQPAHRFPFGASCWATGWDQDTS 
DAPGTLRNLRLRLISRPTCNCIYNQLHQRHLSNPARPGMLCGGPQPGVQGPCQGDSGGPVLC 
LEPDGHWVQAG I I S FAS SCAQEDAPVLLTNTAAHS S WLQARVQGAAFLAQS PETPEMSDEDS 
CVACGSLRTAGPQAGAPSPWPWEARLMHQGQLACGGALVSEEAVLTAAHCFIGRQAPEEWSV 
GLGTRPEEWGLKQL I LHGAYTHPEGGYDMALLLLAQPVTLGASLRPLCLPYPDHHLPDGERG 
WVLGRARPGAGISSLQTVPVTLLGPRACSRLHAAPGGDGSPILPGMVCTSAVGELPSCEGLS 
GAPLVHEVRGTWFLAGLHSFGDACQGPARPAVFTALPAYEDWVSSLDWQVYFAEEPEPEAEP 
GSCLANISQPTSC 

Important features : 
Signal peptide: 

amino acids 1-15 

Homologous region to Serine proteases, trypsin family 

amino acids 79-95, 343-359 and 237-247 

N-glycosylation sites. 

amino acids 37-40 and 564-567 

Kringle domains 

amino acids 79-96, 343-360 and 235-247 
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CGGGCCGCCCCCGGCCCCCATTCGGGCCGGGCCTCGCTGCGGCGGCGACTGAGCCAGGCTGG 
GCCGCGTCCCTGAGTCCCAGAGTCGGCGCGGCGCGGCAGGGGCAGCCTTCCACCACGGGGAG 
CCCAGCTGTCAGCCGCCTCACAGGAAGATGCTGCGTCGGCGGGGCAGCCCTGGCATGGGTGT 
GCATGTGGGTGCAGCCCTGGGAGCACTGTGGTTCTGCCTCACAGGAGCCCTGGAGGTCCAGG 
TCCCTGAAGACCCAGTGGTGGCACTGGTGGGCACCGATGCCACCCTGTGCTGCTCCTTCTCC 
CCTGAGCCTGGCTTCAGCCTGGCACAGCTCAACCTCATCTGGCAGCTGACAGATACCAAACA 
GCTGGTGCACAGCTTTGCTGAGGGCCAGGACCAGGGCAGCGCCTAT6CCAACCGCACGGCCC 
TCTTCCCGGACCTGCTGGCACAGGGCAACGCATCCCTGAGGCTGCAGCGCGTGCGTGTGGCG 
GACGAGGGCAGCTTCACCTGCTTCGTGAGCATCCGGGATTTCGGCAGCGCTGCCGTCAGCCT 
GCAGGTGGCCGCTCCCTACTCGAAGCCCAGCATGACCCTGGAGCCCAACAAGGACCTGCGGC 
CAGGGGACACGGTGACCATCACGTGCTCCAGCTACCAGGGCTACCCTGAGGCTGAGGTGTTC 
TGGCAGGATGGGCAGGGTGTGCCCCTGACTGGCAACGTGACCACGTCGCAGATGGCCAACGA 
GCAGGGCTTGTTTGATGTGCACAGCGTCCTGCGGGTGGTGCTGGGTGCGAATGGCACCTACA 
GCTGCCTGGTGCGCAACCCCGTGCTGCAGCAGGATGCGCACRGCTCTGTCACCATCACAGGG 
CAGCCTATGACATTCCCCCCAGAGGCCCTGTGGGTGACCGTGGGGCTGTCTGTCTGTCTCAT 
TGCACTGCTGGTGGCCCTGGCTTTCGTGTGCTGGAGAAAGATCAAACAGAGCTGTGAGGAGG 
AGAATGCAGGAGCTGAGGACCAGGATGGGGAGGGAGAAGGCTCCAAGACAGCCCTGCAGCCT 
CTGAAACACTCTGACAGCAAAGAAGATGATGGACAAGAAATAGC CTGA CCATGAGGACCAGG 
GAGCTGCTACCCCTCCCTACAGCTCCTACCCTCTGGCTGCAATGGGGCTGCACTGTGAGCCC 
TGCCCCCAACAGATGCATCCTGCTCTGACAGGTGGGCTCCTTCTCCAAAGGATGCGATACAC 
AGACCACTGTGCAGCCTTATTTCTCCAATGGACATGATTCCCAAGTCATCCTGCTGCCTTTT 
TTCTTATAGACACAATGAACAGACCACCCACAACCTTAGTTCTCTAAGTCATCCTGCCTGCT 
GCCTTATTTCACAGTACATACATTTCTTAGGGACACAGTACACTGACCACATCACCACCCTC 
TTCTTCCAGTGCTGCGTGGACCATCTGGCTGCCTTTTTTCTCCAAAAGATGCAATATTCAGA 
CTGACTGACCCCCTGCCTTATTTCACCAAAGACACGATGCATAGTCACCCCGGCCTTGTTTC 
TCCAATGGCCGTGATACACTAGTGATCATGTTCAGCCCTGCTTCCACCTGCATAGAATCTTT 
TCTTCTCAGACAGGGACAGTGCGGCCTCAACATCTCCTGGAGTCTAGAAGCTGTTTCCTTTC 
CCCTCCTTCCTCCCTGCCCCAAGTGAAGACAGGGCAGGGCCAGGAATGCTTTGGGGACACCG 
AGGGGACTGCCCCCCACCCCCACCATGGTGCTATTCTGGGGCTGGGGCAGTCTTTTCCTGGC 
TTGCCTCTGGCCAGCTCCTGGCCTCTGGTAGAGTGAGACTTCAGACGTTCTGATGCCTTCCG 
GATGTCATCTCTCCCTGCCCCAGGAATGGAAGATGTGAGGACTTCTAATTTAAATGTGGGAC 
TCGGAGGGATTTTGTAAACTGGGGGTATATTTTGGGGAAAATAAATGTCTTTGTAAAAAAAA 
AAAAAAAAAAAAAA 



FIGURE 51 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss-DNA41386 
xsiobunit 1 of 1, 316 aa, 1 stop, 1 unknown 
xMW: -1, pi: 4.62, NX(S/T): 4 

MLRRRGSPGMGVHVGAALGALWFCLTGALEVQVPEDPWALVGTDATLCCSFSPEPGFSLAQ 
LNLIWQLTDTKQLVHSFAEGQDQGSAYANRTALFPDLLAQGNASLRLQRVRVADEGSFTCFV 
SIRDFGSAAVSLQVAAPYSKPSMTLEPNKDLRPGDTVTITCSSYQGYPEAEVFWQDGQGVPL 
TGlSr^TSQMANEQGLFDVHSVLRVVLGANGTYSCLVRNPVLQQDAHXSVTITGQPMTFPPEA 
LWVTVGLSVCLIALLVALAFVCWRKIKQSCEEENAGAEDQDGEGEGSKTALQPLKHSDSKED 
DGQEIA 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 251-270 

N-glycosylation site. 

amino acids 91-94, 104-107, 189-192 and 215-218 



Homologous region to ImmTonoglobulins and MHC 

amino acids 217-234 



FIGURE 52 



TTCGTGACCCTTGAGAAAAGAGTTGGTGGTAAATGTGCCACGTCTTCTAAGAAGGGGGAGTC 
CTGAACTTGTCTGAAGCCCTTGTCCGTAAGCCTTGAACTACGTTCTTAAATCTATGAAGTCG 
AGGGACCTTTCGCTGCTTTTGTAGGGACTTCTTTCCTTGCTTCAGCAAC ATGA GGCTTTTCT 
TGTGGAACGCGGTCTTGACTCTGTTCGTCACTTCTTTGATTGGGGCTTTGATCCCTGAACCA 
GAAGTGAAAATTGAAGTTCTCCAGAAGCCATTCATCTGCCATCGCAAGACCAAAGGAGGGGA 
TTTGATGTTGGTCCACTATGAAGGCTACTTAGAAAAGGACGGCTCCTTATTTCACTCCACTC 
ACAAACATAACAATGGTCAGCCCATTTGGTTTACCCTGGGCATCCTGGAGGCTCTCAAAGGT 
TGGGACCAGGGCTTGAAAGGAATGTGTGTAGGAGAGAAGAGAAAGCTCATCATTCCTCCTGC 
TCTGGGCTATGGAAAAGAAGGAAAAGGTAAAATTCCCCCAGAAAGTACACTGATATTTAATA 
TTGATCTCCTGGAGATTCGAAATGGACCAAGATCCCATGAATCATTCCAAGAAATGGATCTT 
AATGATGACTGGAAACTCTCTAAAGATGAGGTTAAAGCATATTTAAAGAAGGAGTTTGAAAA 
ACATGGTGCGGTGGTGAATGAAAGTCATCATGATGCTTTGGTGGAGGATATTTTTGATAAAG 
AAGATGAAGACAAAGATGGGTTTATATCTGCCAGAGAATTTACATATAAACACGATGAGTTA 
TAGAGATACATCTACCCTTTTAATATAGCACTCATCTTTCAAGAGAGGGCAGTCATCTTTAA 
AGAACATTTTATTTTTATACAATGTTCTTTCTTGCTTTGTTTTTTATTTTTATATATTTTTT 
CTGACTCCTATTTAAAGAACCCCTTAGGTTTCTAAGTACCCATTTCTTTCTGATAAGTTATT 
GGGAAGAAAAAGCTAATTGGTCTTTGAATAGAAGACTTCTGGACAATTTTTCACTTTCACAG 
ATATGAAGCTTTGTTTTACTTTCTCACTTATAAATTTAAAATGTTGCAACTGGGAATATACC 
ACGACATGAGACCAGGTTATAGCACAAATTAGCACCCTATATTTCTGCTTCCCTCTATTTTC 
TCCAAGTTAGAGGTCAACATTTGAAAAGCCTTTTGCAATAGCCCAAGGCTTGCTATTTTCAT 
GTTATAATGAAATAGTTTATGTGTAACTGGCTCTGAGTCTCTGCTTGAGGACCAGAGGAAAA 
TGGTTGTTGGACCTGACTTGTTAATGGCTACTGCTTTACTAAGGAGATGTGCAATGCTGAAG 
TTAGAAACAAGGTTAATAGCCAGGCATGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAG 
GCTGAGGCGGGCGGATCACCTGAGGTTGGGAGTTCGAGACCAGCCTGACCAACACGGAGAAA 
CCCTATCTCTACTAAAAATACAAAGTAGCCCGGCGTGGTGATGCGTGCCTGTAATCCCAGCT 
ACCCAGGAAGGCTGAGGCGGCAGAATCACTTGAACCCGAGGCCGAGGTTGCGGTAAGCCGAG 
ATCACCTNCAGCCTGGACACTCTGTCTCGAAAAAAGAAAAGAACACGGTTAATACCATATNA 
ATATGTATGCATTGAGACATGCTACCTAGGACTTAAGCTGATGAAGCTTGGCTCCTAGTGAT 
TGGTGGCCTATTATGATAAATAGGACAAATCATTTATGTGTGAGTTTCTTTGTAATAAAATG 
TATCAATATGTTATAGATGAGGTAGAAAGTTATATTTATATTCAATATTTACTTCTTAAGGC 
TAGCGGAATATCCTTCCTGGTTCTTTAATGGGTAGTCTATAGTATATTATACTACAATAACA 
TTGTATCATAAGATAAAGTAGTAAACCAGTCTACATTTTCCCATTTCTGTCTCATCAAAAAC 
TGAAGTTAGCTGGGTGTGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGGGCCAAGGAGGG 
TGGATCACTTGAGATCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCTTGTCTCTA 
CTAAAAATACAAAAATTAGCCAGGCGTGGTGGTGCACACCTGTAGTCCCAGCTACTCGGGAG 
GCTGAGACAGGAGATTTGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCCAAGATTGTGCC 
ACTGCACTCCAGCCTGGGTGACAGAGCAAGACTCCATCTCAAAAJyy^AAAAAAAGAAGCAGA 
CCTACAGCAGCTACTATTGAATAAATACCTATCCTGGATTTT 



FIGURE 53 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA44194 
xsubimit 1 of 1, 211 aa, 1 stop 
xMW: 24172, pi: 5.99, NX(S/T) : 1 

MRLFLWNAVLTLFVTS L IGALI PEPEVKI EVLQKPF I CHRKTKGGDLMLVHYEGYLEKDGSL 
FHSTHKHNNGQPIWFTLGILEALKGWDQGLKGMCVGEKRKLIIPPAL6YGKEGKGKIPPEST 
LIFNIDLLEIRNGPRSHESFQEMDLNDDWKLSKDEVKAYLKKEFEKHGAWNESHHDALVED 
I FDKEDEDKDGFI S ARE FT YKHDEL 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 176-179 

Casein kinase II phosphorylation site. 

amino acids 143-146, 156-159, 178-181 and 200-203 

Endoplasmic reticulxam targeting sequence. 

amino acids 208-211 

FKBP-type peptidyl -prolyl cis-trans isomerase 

amino acids 78-114 and 118-131 

EF-hand calcium-binding domain. 

amino acids 191-203, 184-203 and 140-159 

S-lOO/ICaBP type calcium binding domain 

amino acids 183-203 



FIGURE 54 

AATAAAGCTTCCTTAATGTTGTATATGTCTTTGAAGTACATCCGTGCATTTTTTTTTAGCAT 
CCAACCATTCCTCCCTTGTAGTTCTCGCCCCCTCAAATCACCCTCTCCCGTAGCCCACCCGA 
CTAACATCTCAGTCTCTGAAAATGCACAGAGATGCCTG6CTACCTCGCCCTGCCTTCAGCCT 
CACGGGGCTCAGTCTCTTTTTCTCTTTGGTGCCACCAGGACGGAGCATGGAGGTCACAGTAC 
CTGCCACCCTCAACGTCCTCAATGGCTCTGACGCCCGCCTGCCCTGCACCTTCAACTCCTGC 
TACACAGTGAACCACAAACAGTTCTCCCTGAACTGGACTTACCAGGAGTGCAACAACTGCTC 
TGAGGAGATGTTCCTCCAGTTCCGCATGAAGATCATTAACCTGAAGCTGGAGCGGTTTCAAG 
ACCGCGTGGAGTTCTCAGGGAACCCCAGCAAGTACGATGTGTCGGTGATGCTGAGAAACGTG 
CAGCCGGAGGATGAGGGGATTTACAACTGCTACATCATGAACCCCCCTGACCGCCACCGTGG 
CCATGGCAAGATCCATCTGCAGGTCCTCATGGAAGAGCCCCCTGAGCGGGACTCCACGGTGG 
CCGTGATTGTGGGTGCCTCCGTCGGGGGCTTCCTGGCTGTGGTCATCTTGGTGCTGATGGTG 
GTCAAGTGTGTGAGGAGAAAAAAAGAGCAGAAGCTGAGCACAGATGACCTGAAGACCGAGGA 
GGAGGGCAAGACGGACGGTGAAGGCAACCCGGATGATG6CGCCAAGTAGTGGGTGGCCGGCC 
CTGCAGCCTCCCGTGTCCCGTCTCCTCCCCTCTCCGCCCTGTACAGTGACCCTGCCTGCTCG 
CTCTTGGTGTGCTTCCCGTGACCTAGGACCCCAGGGCCCACCTGGGGCCTCCTGAACCCCCG 
ACTTCGTATCTCCCACCCTGCACCAAGAGTGACCCACTCTCTTCCATCCGAGAAACCTGCCA 
TGCTCTGGGACGTGTGGGCCCTGGGGAGAGGAGAGAAAGGGCTCCCACCTGCCAGTCCCTGG 
GGGGAGGCAGGAGGCACATGTGAGGGTCCCCAGAGAGJLAGGGAGTGGGTGGGCAGGGGTAGA 
GGAGGGGCCGCTGTCACCTGCCCAGTGCTTGCCTGGCAGTGGCTTCAGAGAGGACCTGGTGG 
GGAGGGAGGGCTTTCCTGTGCTGACAGCGCTCCCTCAGGAGGGCCTTGGCCTGGCACGGCTG 
TGCTCCTCCCCTGCTCCCAGCCCAGAGCAGCCATCAGGCTGGAGGTGACGATGAGTTCCTGA 
AACTTGGAGGGGCATGTTAAAGGGATGACTGTGCATTCCAGGGCACTGACGGAAAGCCAGGG 
CTGCAGGCAAAGCTGGACATGTGCCCTGGCCCAGGAGGCCATGTTGGGCCCTCGTTTCCATT 
GCTAGTGGCCTCCTTGGGGCTCCTGTTGGCTCCTAATCCCTTAGGACTGTGGATGAGGCCAG 
ACTGGAAGAGCAGCTCCAGGTAGGGGGCCATGTTTCCCAGCGGGGACCCACCAACAGAGGCC 
AGTTTCAAAGTCAGCTGAGGGGCTGAGGGGTGGGGCTCCATGGTGAATGCAGGTTGCTGCAG 
GCTCTGCCTTCTCCATGGGGTAACCACCCTCGCCTGGGCAGGGGCAGCCAAGGCTGGGAAAT 
GAGGAGGCCATGCACAGGGTGGGGCAGCTTTCTTTGGGGCTTCAGTGAGAACTCTCCCAGTT 
GCCCTTGGTGGGGTTTCCACCTGGCTTTTGGCTACAGAGAGGGAAGGGAAAGCCTGAGGCCG 
GCATAAGGGGAGGCCTTGGAACCTGAGCTGCCAATGCCAGCCCTGTCCCATCTGCGGCCACG 
CTACTCGCTCCTCTCCCAACAACTCCCTTCGTGGGGACAAAAGTGACAATTGTAGGCCAGGC 
ACAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGTGGATTACCTCCAT 
CTGTTTAGTAGAAATGGGCJy^CCCCATCTCTACTAAAAATACAAGAATTAGCTGGGCGTG 
GTGGCGTGTGCCTGTAATCCCAGCTATTTGGGAGGCTGAGGCAGGAGAATCGCTTGAGCCCG 
GGAAGCAGAGGTTGCAGTGAACTGAGATAGTGATAGTGCCACTGCAATTCAGCCTGGGTGAC 
ATAGAGAGACTCCATCTCAAAAAAAA 



FIGURE 55 

</usr/seqdb2/ sst/DNA/Dnaseqs . min/ss .DNA45415 
<subunit 1 of 1, 215 aa, 1 stop 
<MW: 24326, pi: 6.32, NX(S/T) : 4 

MHRDAWLPRPAFSLTGLSLFFSLVPPGRSMEVTVPATLNVIJMGSDARLPCTFNSCYTVNHK^ 
FSLlMTYQECimCSEEMFLQFEMKIINLKLERPQDRVEFSGNPSKYDVSVMLRNVQPEDEGI 
YNCYIMNPPDRHRGHGKIHLQVLMEEPPERDSTVAVIVGASVGGFLAWILVLMWKCVRRK 
KEQKLSTDDLKTEEEGKTDGEGNPDDGAK 

Important features: 
Signal peptide: 

amino acids 1-20 

Transmembrane domain: 
amino acids 161-179 

Immiinoglobulin-like fold: 

amino acids 83-127 

N-glycosylation sites. 

amino acids 42-45, 66-69 and 74-77 



FIGURE 56 



GTTCTCGCCCCCTCAAATCACCTTCTCCCTTAGCCCACCCNACTAACATCTCAGTCTCTGAA 
AATGCACAGAGATGCCTGGCTACCTCGCCCTGCCTTCAGCCTCACGGGGCTCAGTCTCTTTT 
TCTCTTTGGTGCCACCAGGACGGAGCATGGAGGTCCACAGTACCTGNCCACCCTCAACGTCC 
TCAATGGCTCTGACGCCCGCCTGCCCTGCCCTTCAACTCCTGCTACACAGTGAACCACAAAC 
AGTTCTCCCTGAACTGGACTTACCAGGAGTGCAACAACTGCTCTGAGGAGATGTTCCTCCAG 
TTCCGCATGAAGATCATTAACCTGAAGCTGGAGCGGTTTCAAGACCGCGTGGAGTTCTCAGG 
GAACCCCAGCAAGTACGATGTGTCGGTGATGCTGAGAAACGTGCAGCCGGAGGATGAGGGGA 
TTTACAACTGCTACATCATGAACCCCCC 



FIGURE 57 



TCACGGGGCTCATCTCTTTTTCTCTTTGGTGCCCACCAGGACGGAGCATGGAGGTNCACATA 
CCTGCCACCCTCAACGTCCTCAATGGCTTTGACGCCCGCCTGCCCTGCACCTTCAACTCCNG 
CTACACAGTGAACCACAAACAGTTCTCCCTGAACTGGATTTACCAGGAGTGCAACAACTGGC 
TCTGAGGAGATGTTCCTCCAGTTCCCGCATGGAAGATCATTTAACCTGAAAGCTGGAAGCGG 
TTTTCAAGAACCGCGTGGAAGTTTCTCAGGGAACCCCAGCAAGTACGATGTGTCGGTGATGC 
TGAGAAACGTGCAGCCGGAGGATGAGGGGATTTACAACTGCTACATCATGAACCCCCC 



FIGURE 58 



TGCGGCGACCGTCGTACACCATGGGCCTCCACCTCCGCCCCTACCGTGTGGGGCTGCTCCCGGATGGCCTCCTGT 

TCCTCTTGCTGCTGCTAATGCTGCTCGCGGACCCAGCGCTCCCGGCCGGACGTCACCCCCCAGTGGTGCTGGTCC 

CTGGTGATTTGGGTAACCAACTGGAAGCCAAGCTGGACAAGCOSACAGTGGTGCAOTACCTCTGCTCC^ 

CCGAAAGCTACTTCAC»ATCTGGCTGAACCTGGAACTGCTGCTGCCTGTCATCATTGACTGCTGGATTGACAATA 

TCAGGCTGGTTTAC::AAG?UW5LCaTCCAGGGCCACCCAGTTTCCTGATGGTGTGGATGTACG^^ 

GGAAGACCTTCTCACTGGAGTTCCTGGACCCCAGCAaAAGCAGCGTGGGTTCCTATTTCCACACCATGGTGGAGA 

GCCTTGTGGGCTGGGGCTAGACACGGGGTGAGGATGTCCGAGGGGCTCCCTATGACTGGCGCCGAGCCCCAAATG 

AAAACGGGCCCTACTTCCTGGCCCTCCGCGAGATGATCGAGGAGATGTACCAGCTGTATGGGGGCCCCGTGGTGC 

TGGTTGCCCACAGTATGGGCAACATGTACACGCTCTACTTTCTGa^GCGGCAGCCGCAGGCCTGGAAGGAC^ 

ATATCCGGGCCTTCGTGTCACTGGGTGCGCCCTGGGGGGGCGTGGCCAAGACCCTGCGCGTCCTGGCTTCAGGAG 

ACAACa^CCGGATCCCAGTCATCGGGCCCCTGA^GATCCGGGAGCAGCAGCGGTCAGCTGTCTCCACCAGCTGGC 

TGCTGCCCTACT^OTACACATGGTCACCTGAGAAGGTGTTCGTGCAGACACCCAC^ 

ACTACCGCy^GTTCTTCCAGGACATCGGCTTTGAAGATGGCTGGCTCATGCGGCAGGACaCAGAAGGGCTGGTGG 

AAGCO^CGATGCCACCTGGCGTGOlGCTGCACTGCCTCTATGGTACTGGCGTCCCa^CACCAGACTCCTTCTACT 

ATGAGAGCTTCCCTGACCGTGACCCTAAAATCTGCTTTGGTGACGGCGATGGTACTGTGAACTTGAAGAGTGCCC 

TGCAGTGCCa.GGCCTGGC:S^GAGCCGCCAGGAGCACa^GTGTTGCTGCaGGAGCTGCCAGGCAGCGAGC^ 

AGATGCTGGCCAACGCCACCACCCTGGCCTATCTGAAACGTGTGCTCCTTGGGCCCTGACTCCTGTGCCACAGQA 

CTCCTGTGGCTCGGCCGTGGACCTGCTGTTGGCCTCTGGGGCTGTCATGGCCCACGCGTTTTGCAAAGTTTGTGA 

GTGGCAGTGAAGAAGGAAGAAATGAGAGTCTAGACTCAAGGGACy«:TGGATGGCAAQAATGCTGCTGATGGTGGA 
ACTGCTGTGACCTTAGGACTGGCTCCACAGGGTGGACTGGCTGGGCCCTGGTCCCAGTCCCTGCCTGGGGCCATG 
TGTCCCCCTATTCCTGTGGGCTTTTGATACTTGCCTACTGGGCCCTGGCCCCGCAGCCTTCCTATGAGGGATGTT 
ACTGGGCTGTGGTCCTGTACCCAGAGGTCCCAGGGATCGGCTCCTGGCCCCTCGGGTGACCCTTCCCACACACCA 
GCCACAGATAGGCCTGCCACTGGTCATGGGTAGCTAGAGCTGCTGGCTTCCCTGTGGCTTAGCTGGTGGCCAGCC 
TGACTGGCTTCCTGGGCGAGCCTAGTAGCTCCTGCAGGCAGGGGCAGTTTGTTGCGTTCTTCGTGGTTCCCAGGC 
CCTGGGACATCTCACTCCa^CTCCTACCTCCCTTACCACCAGGAGCATTCS^GCTCTGGATTQGGCAGCAGATGT^ 
CCCCCAGTCCCGCAGGCTGTGTTCCAGGGGCCCTGATTTCCTCGGATGTGCTATTGGCCCCAGGACTGAAGCTGC 
CTCCCTTCACCCTGGGACTGTGGTTCCAAGGATGAGAGCAGGGGTTGGAGCCATGGCCTTCTGGGAACCTATGGA 
GAAAGGGAATCCAAGGAAGCAGCO^GGCTGCTCGCAGCTTCCCTGAGCTGCACCTCTTGCTAACCCCACCyVTC^ 
CACTGCCACCCTGCCCTAGGGTCTCaCTAGTACCAAGTGGGTCAGCACAGGGCTGAGGATGGGGCTCCTATCCAC 
CCTGGCCAGCACCCAGCTTAGTGCTGGGACTAGCCCAGAAACTTGAATGGGACCCTGAGAGAGCCAGGGGTCCCC 
TGAGGCCCCCCTAGGGGCTTTCTGTCTGCCCCAGGGTGCTCCATGGATCTCCCTGTGGCAGCAGGCATGGAGAGT 
CAGGGCTGCCTTCATGGCAGTAGGCTCTAAGTGGGTGACTGGCCACAGGCCGAGAAAAGGGTACAGCCTCTAGGT 
GGGGTTCCCAAAGACGCCTTCAGGCTGGACTGAGCTGCTCTCCCACAGGGTTTCTGTGCAGCTGGATTTTCTCTG 
TTGCATACATGCCTGGCATCTGTCTCCCCTTGTTCCTGAGTGGCCCCACATGGGGCTCTQAGCAGGCTGTATCTG 
GATTCTGGCAATAAAAGTACTCTGGATGCTGTAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 59 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA44189 
xsubunit 1 of 1, 412 aa, 1 stop 
xMW: 46658, pi: 6,65, NX(S/T): 4 

MGLHLRPYRVGLLPDGLLFLLLLLMLLADPALPAGRHPPVVLVPGDLGNQLEAKLDKPTVVH 
YLCS KKTESY FT I WLNLELLLPVI IDCWIDNIRLVYNKTSRATQFPDGVDVRVPGFGKTFSL 
EFLDPSKSSVGSYFHTMVESLVGWGYTRGEDVRGAPYDWRRAPNENGPYFLALREMIEEMYQ 
LYGGPWLVAHSMGNMYTLYFLQRQPQAWKDKYIRAFVSLGAPWGGVAKTLRVLASGDNNRI 
PVIGPLKIREQQRSAVSTSWLLPYNYTWSPEKVFVQTPTINYTLRDYRKFFQDIGFEDGWLM 
RQDTEGLVEATMPPGVQLHCLYGTGVPTPDSFYYESFPDRDPKICFGDGDGTVNLKSALQCQ 
AWQSRQEHQVLLQELPGSEHIEMLANATTLAYLKRVLLGP 

Importcuit features: 
Signal peptide: 

amino acids 1-28 

Potential lipid substrate binding site: 
amino acids 147-164 

N-glycosylation sites. 

amino acids 99-102, 273-276, 289-292 and 398-401 

Lipases, serine proteins 

amino acids 189-201 

Beta-transducin family Trp-Asp repeat 

amino acids 353-365 



FIGURE 60 



CGGACGCGTGGGCGGACGCGTGGGGCGGCGGCAGCGGCGGCGACGGCGACATGGAGAGCGGG 
GCCTACGGCGCGGCCAAGGCGGGCGGCTCCTTCGACCTGCGGCGCTTCCTGACGCAGCCGCA 
GGTGGTGGCGCGCGCCGTGTGCTTGGTCTTCGCCTTGATCGTGTTCTCCTGCATCTATGGTG 
AGGGCTACAGCAATGCCCACGAGTCTAAGCAGATGTACTGCGTGTTCAACCGCAACGAGGAT 
GCCTGCCGCTATGGCAGTGCCATCGGGGTGCTGGCCTTCCTGGCCTCGGCCTTCTTCTTGGT 
GGTCGACGCGTATTTCCCCCAGATCAGCAACGCCACTGACCGCAAGTACCTGGTCATTGGTG 
ACCTGCTCTTCTCAGCTCTCTGGACCTTCCTGTGGTTTGTTGGTTTCTGCTTCCTCACCAAC 
CAGTGGGCAGTCACCAACCCGAAGGACGTGCTGGTGGGGGCCGACTCTGTGAGGGCAGCCAT 
CACCTTCAGCTTCTTTTCCATCTTCTCCTGGGGTGTGCTGGCCTCCCTGGCCTACCAGCGCT 
ACAAGGCTGGCGTGGACGACTTCATCCAGAATTACGTTGACCCCACTCCGGACCCCAACACT 
GCCTACGCCTCCTACCCAGGTGCATCTGTGGACAACTACCAACAGCCACCCTTCACCCAGAA 
CGCGGAGACCACCGAGGGCTACCAGCCGCCCCCTGTGTAC TGAG TGGCGGTTAGCGTGGGAA 
GGGGGACAGAGAGGGCCCTCCCCTCTGCCCTGGACTTTCCCATCAGCCTCCTGGAACTGCCA 
GCCCCTCTCTTTCACCTGTTCCATCCTGTGCAGCTGACACACAGCTAAGGAGCCTCATAGCC 
TGGCGGGGGCTGGCAGAGCCACACCCCAAGTGCCTGTGCCCAGAGGGCTTCAGTCAGCCGCT 
CACTCCTCCAGGGCACTTTTAGGAAAGGGTTTTTAGCTAGTGTTTTTCCTCGCTTTTAATGA 
CCTCAGCCCCGCCTGCAGTGGCTAGAAGCCAGCAGGTGCCCATGTGCTACTGACAAGTGCCT 
CAGCTTCCCCCCGGCCCGGGTCAGGCCGTGGGAGCCGCTATTATCTGCGTTCTCTGCCAAAG 
ACTCGTGGGGGCCATCACACCTGCCCTGTGCAGCGGAGCCGGACCAGGCTCTTGTGTCCTCA 
CTCAGGTTTGCTTCCCCTGTGCCCACTGCTGTATGATCTGGGGGCCACCACCCTGTGCCGGT 
GGCCTCTGGGCTGCCTCCCGTGGTGTGAGGGCGGGGCTGGTGCTCATGGCACTTCCTCCTTG 
CTCCCACCCCTGGCAGCAGGGAAGGGCTTTGCCTGACAACACCCAGCTTTATGTAAATATTC 
TGCAGTTGTTACTTAGGAAGCCTGGGGAGGGCAGGGGTGCCCCATGGCTCCCAGACTCTGTC 
TGTGCCGAGTGTATTATAAAATCGTGGGGGAGATGCCCGGCCTGGGATGCTGTTTGGAGACG 
GAATAAATGTTTTCTCATTCAAAG 



FIGURE 61 

< /usr/ seqdb2 /ss t/DNA/Dnaseqs . min/ ss . DNA4 83 04 
<subunit 1 of 1, 224 aa, 1 stop 
<MW: 24810, pi: 4.75, NX{S/T): 1 

MESGAYGAAKAGGSFDLRRFLTQPQWARAVCLVFALIVFSCIYGEGYSNAHESKQMYCVFN 
RNEDACRYGSAIGVLAFLASAFFLWDAYFPQISNATDRKYLVIGDLLFSALWTFLWFVGFC 
FLTNQWAVTNPKDVLVGADS VRAAI TFSFFS I FSWGVLASLAYQRYKAGVDDFIQNYVDPTP 
DPNTAYASYPGASVDNYQQPPFTQNTIETTEGYQPPPVY 

Important features: 

Type II Transmembrane domain: 

amino acids 1-45 

Other transmembrane domains: 

amino acids 74-90, 108-126 and 145-161 

N-glycosylation site. 

amino acids 97-100 



FIGURE 62 

GftGCC^CCTACCCTGCTCCG&GGCCTlGGCCTGCAGGGCCTCATCGGCCAGAGGGTGATCAGTGAGC^ 

CCCGTGGCCGAGGCCCCCCAGGTGGCTGGCGGGCAGGGGGACGGAGGTGATGGCGAGGAAGCGGaGCCAQAGGGG 

ATGTTCAAGGCCTGTGAGGACTCCAAGAGAAAAGCCCGGGGCTACCTCCGCCTGGTGCCCCTGTTTGTGCTGCTG 

GCCCTGCTCGTGCTGGCTTCGGCGGGGGTGCTACTCTGGTATTTCCTAGGGTACAAGGCGGAGGTGATGGTCAGC 

CAGGTGTACTCAGGCAGTCTGCGTGTACTCAATCGCCACTTCTCCCAGGATCTTACCCGCCGGGAATCTAGTGCC 

TTCCGCaGTGAAACCGCCAAAGCCCa.GAAGATGCTCaAGGAGCTCATCACC:aGCaCCCGCCTGGGAACTTAC^^ 

AACTCCAGCTCCGTCTATTCCTTTGGGGAGGQACCCCTCACCTGCTTCTTCTGGTTCATTCTCCAAATCCCCGAG 

CACCGCCGGCTGATGCTGAGCCCCGAGGTGGTGCAGGCACTGCTGGTGGAGGAGCT6CTGTCCACAGTCAACAGC 

TCGGCTGCCGTCCCCTACAGGGCCGAGTACGAAGTGGACCCCGAGG6CCTAGTGATCCTGGAAGCCAGTGTGAAA 

GACATAGCTGCATTGAATTCCACGCTGGGTTGTTACCGCTACAGCTACGTGGGCCAGGGCCS^GGTCCTCCGGCTG 

AAGGGGCCTGACCACCTGGCCTCCAGCTGCCTGTGGCaCCTGCa.GGGCCCCAAGGACCTCATGCTCS^CTCCGG 

CTGGAGTGGACGCTGGCAGAGTGCCGGGACCGACTGGCCATGTATGACGTGGGCGGGCCCCTGGAGAAGAGGCTC 

ATCACCTCGGTGTACGGCTGCAGCCGCCAGGAGCCCGTG6TGGAGGTTCTGGCGTCGGGGGCCATCATGGCGGTC 

GTCTGGAAGAAGGGCCTGCACAGCTACTACGACCCCTTCGTGCTCTCCGTGCAGCCGGTGGTCTTCCAGGCCTGT 

QAAGTGAACCTGACGCTGQACAACAGGCTCGACTCCCAGGGCGTCCTCAGCACCCCGTACTTCCCCAGCTACTAC 

TC6CCCCAAACCCACTGCTCCTGGCACCTCACGGTGCCCTCTCTGGACTACGGCTTGGCCCTCTGGTTTGATGCC 

TATGCACTGAGGAGGCAQAAGTATGATTTGCCGTGOICCCAGGGCCAGTGGACGATCCAQAACAGGAGGCTGTGT 

GGCTTGCGCATCCTGCAGCCCTACGCCGAGAGGATCCCCGTGGTGGCCACGGCCGGGATCACCATCAACTTCACC 

TCCCAQATCTCCCTCACCGGGCCCGGTGTGCGGGTGCACTATGGCTTGTACAACCAGTCGGACCCCTGCCCTGGA 

GAGTTCCTCTGTTCTGTGAATGGACTCTGTGTCCCTGCCTGTGATGGGGTCAAGGACTGCCCCAACGGCCTGGAT 

GAGAGAAACTGCGTTTGCAGAGCCACATTCCAGTGCAAAGAGGACAGCACATGCATCTCACTGCCCAAGGTCTGT 

GATGGGCAGCCTGATTGTCTCaACGGCAGCGATGAAGAGCAGTGCCAGGAAGGGGTGCCATGTGGGACATTCACC 

TTCCAGTGTGAGGACCGGAGCTGCGTGAAGAAGCCCAACCCGCAGTGTGATGGGCGGCCCGACTGCAGGGACGGC 

TCGGATGAGGAGCACTGTGACTGTGGCCTCCAGGGCCCCTCCAGCCGCATTGTTGGTGGAGCTGTGTCCTCCGAG 

GGTQAGTGGCCATGGCAGGCCAGCCTCa^GGTTCGGGGTCGAOkCATCTGTGGGGGGGCCCTCATCGCTGACCGC 

TGGGTGATAACAGCTGCCCACTGCTTCCAGGAGGACAGCATGGCCTCCACGGTGCTGTGGACCGTGTTCCTGGGC 

AAGGTGTGGCAGAACTCGCGCTGGCCTGGAGAGGTGTCCTTCAAGGTGAGCCGCCTGCTCCTGCACCCGTACCAC 

GAAGAGGACAGCCATGACTACGACGTGGCGCTGCTGCAGCTCGACCACCCGGTGGTGCGCTCGGCCGCCGTGCGC 

CCCGTCTGCCTGCCCGCGCGCTCCCACTTCTTCGAGCCCGGCCTGCACTGCTGGATTACGGGCTGGGGCGCCTTG 

CGCGAGGGCGGCCCCATCTkGCAACGCTCTGCAGAAAGTGGATGTGCAGTTGATCCCACAGGACCT 

GCCTATCGCTACOVGGTGACGCCACGCATGCTGTGTGCCGGCTACCGCAAGGGa^GAAGGATGCCTGTCAGGGT 

GACTCAGGTGGTCCGCTGGTGTGCAAGGCACTCaLGTGGCCGCTGGTTCCTGGCGGGGCTGGTCAGCTGGGGCCTG 

GGCTGTGGCCGGCCTAACTACTTCGGCGTCTACykCCCGCATCACAGGTGTGATCAGCTGGATCCaGCy^GT^ 

ACCTGAGGAACTGCCCCCCTGC7U^GC».GGGCCCACCTCCTGGACTCAGAGAGCCa^GGGC»^ 

GGGACAAGTATTCTGGCGGGGGGTGGGGGAGAGAGCAGGCCCTGTGGTGGCAGGAGGTGGCATCTTGTCTCGTCC 

CTGATGTCTGCTCCAGTGATGGCAGGAGGATGGAGAAGTGCCAGCaGCTGGGGGTCAAGACGTCCCCTGAGGACC 

CAGGCCCACACCCAGCCCTTCTGCCTCCCAATTCTCTCTCCTCCGTCCCCTTCCTCCACTGCTGCCTAATGCAAG 

GO^GTGGCTCAGCAGCAAGAATGCTGGTTCTACaTCCCGAGGAGTGTCTGAGGTGCGCCCCACTCTGTACAGA^ 

CTGTTTGGGCAGCCTTGCCTCCAGAGAGCAGATTCCAGCTTCGGAAGCCCCTGGTCTAACTTGGGATCTGGGAAT 

GGAAGGTGCTCCCATCGGAGGGGACCCTCAGAGCCCTGGAGACTGCCAGGTGGGCCTGCTGCCACTGTAAGCCAA 

AAGGTGGGQAAGTCCTGACTCCAGGGTCCTTGCCCCACCCCTGCCTGCCACCTGGGCCCTCACAGCCCAQACCCT 

CACTGGGAGGTGAGCTCAGCTGCCCTTTGGAATAAAGCTGCCTGATCAAAAAAAAAAAAAAAAAAAAA 



FIGURE 63 



></usr/s eqdb 2 / s s t /DNA/Dna seqs . min/ s s . DNA4 9152 
xsubunit 1 of 1, 802 aa, 1 stop 
xMW: 88846, pi: 6.41, NX(S/T): 7 

MPVAEAPQVAGGQGDGGDGEEAEPEGMFKACEDSKRKARGYLRLVPLFVLLALLVLASAGVL 
LWYFLGYKAEVMVSQWSGSLRVLN^FSQDLTRRESSAFRSETAKAQKMLKELITSTRLGT 
YYNSSSVYSFGEGPLTCFFWFILQIPEHRRLMLSPEWQALLVEELLSTVNSSAAVPYRAEY 
EVDPEGLVILEASVKDIAALNSTLGCYRYSYVGQGQVLRLKGPDHLASSCLWHLQGPKDLML 
KLRLEWTIAECRDRLAimDVAGPLEKRLITSVYGCSRQEPVVEVLASGAIMAVVWKKGLHSY 
YDPFVLSVQPWFQACEVNLTLDNRLDSQGVLSTPYFPSYYSPQTHCSWHLTVPSLDYGLAL 
WFDAYALRRQKYDLPCTQGQWTIQNRRLCGLRILQPYAERIPWATAGITINFTSQISLTGP 
GVRVHYGLYNQSDPCPGEFLCSVNGLCVPACDGVKDCPNGLDERNCVCRATFQCKEDSTCIS 
LPKVCDGQPDCLNGSDEEQCQEGVPCGTFTFQCEDRSCVKKPNPQCDGRPDCRDGSDEEHCD 
CGLQGPSSRIVGGAVSSEGEWPWQASLQVRGRHICGGALIADRWVITAAHCFQEDSMASTVL 
WTVFLGKVWQNSRWPGEVSFKVSRLLLHPYHEEDSHDYDVALLQLDHPWRSAAVRPVCLPA 
RSHFFEPGLHCWITGWGALREGGPISNALQKVDVQLIPQDLCSEAYRYQVTPRMLCAGYRKG 
KKDACQGDSGGPLVCKALSGRWFLAGLVSWGLGCGRPNYFGVYTRITGVISWIQQWT 

Important features : 

Type II transmembrane domain: 

amino acids 46-67 

Serine proteases, trypsin family, histidine active site, 
amino acids 604-609 
N-glycosylation sites. 

amino acids 127-130, 175-178, 207-210, 329-332, 424-427, 444-447 
and 509-512 
Kringle domains. 

amino acids 746-758 and 592-609 

Homologous region to Kallikrein Light Chain: 

amino acids 568-779 

Homologous region to Low-density lipoprotein receptor: 

amino acids 451-567 



FIGURE 64 



GCACCCAGGGCCAGTGGACGATCCAGAACAGGAGGCTGTGTGGCTTGCGCATCCTGCAGCCC 
TACGCCGAGAGGATCCCCGTGGTGGCCACGGCCGGGATCACCATCAACTTCACCTCCCAGAT 
CTCCCTCACCGGGCCCGGTGTGCGGGTGCACTATGGCTTGTACAACCAGTCGGACCCCTGCC 
CTGGAGAGTTCCTCTGTTCTGTGAATGGACTCTGTGTCCCTGCCTGTGATGGGGTCAAGGAC 
TGCCCCAACGGCCTGGATGAGAGAAACTGCGTTTGCAGAGCCACATTCCAGTGCAAAGAGGA 
CAGCACATGCATCTCACTGCCCAAGGTCTGTGATGGGCAGCCTGATTGTCTCAACGGCAGCG 
ATGAAGAGCAGTGCCAGGAAGGGGTGCCATGTGGGACATTCACCTTCCAGTGTGAGGACCGG 
AGCTGCGTGAAGAAGCCCAACCCGCAGTGTGATGGGCGGCCCGACTGCAGGGACGGCTCGGA 
TGAGGAGCACTGTGACTGTGGCCTCCAGGGCCCCTCCAGCCGCATTGTTGGTGGAGCTGTGT 
CCTCCGAGGGTGAGTGGCCATGGCAGGCCAGCCTCCAGGTTCGGGGTCGACACATCTGTGGG 
GGGGCCCTCATCGCTGACCGCTGGGTGATAACAGCTGCCCACTGCTTCCAGGAGGACAGCAT 
GGCCTCCACGGTGCTGTGGACCGTGTTCCTGGGCAAGGTGTGGCAGAACTCGCGCTGGCCTG 
GAGAGGTGTCCTTCAAGGTGAGCCGCCTGCTCCTGCACCCGTACCACGAAGAGGACAGCCAT 
GACTACGACGTGGCGCTGCTGCAGCTCGACCACCCGGTGGTGCGCTCGGCCGCCGTGCGCCC 
CGTCTGCCTGCCCGCGCGCTCCCACTTCTTCGAGCCCGGCCTGCACTGCTGGATTACGGGCT 
GGGGCGCCTTGCGCGAGGGCGGCCCCATCAGCAACGCTCTGCAGAAAGTGGATGTGCAGTTG 
ATCCCACAGGACCTGTGCAGCGAGGCCTATCGCTACCAGGTGACGCCACGCATGCTGTGTGC 
CGGCTACCGCAAGGGCAAGAAGGATGCCTGTCAGGGTGACTCAGGTGGTCCGCTGGTGTGCA 
AGGCACTCAGTGGCCGCTGGTTCCTGGCGGGGCTGGTCAGCTGGGGCCTGGGCTGTGGCCGG 
CCTAACTACTTCGGCGTCTACACCCGCATCACAGGTGTGATCAGCTGGATCCAGCAAGTGGT 
GACCTGAGGAACTGCCCCCCTGCAAAGCAGGGCCCACCTCCTGGACTCAGAGAGCCCAGGGC 
AACTGCCAAGCAGGGGGACAAGTAT 



FIGURE 65 



GGACGAGGGCAGATCTCGTTCTGGGGCAAGCCGTTGACACTCGCTCCCTGCCACCGCCCGGG 
CTCCGTGCCGCCAAGTTTTCATTTTCCACCTTCTCTGCCTCCAGTCCCCCAGCCCCTGGCCG 
AGAGAAGGGTCTTACCGGCCGGGATTGCTGGAAACACCAAGAGGTGGTTTTTGTTTTTTAAA 
ACTTCTGTTTCTTGGGAGGGGGTGTGGCGGGGCAGGATGAGCAACTCCGTTCCTCTGCTCTG 
TTTCTGGAGCCTCTGCTATTGCTTTGCTGCGGGGAGCCCCGTACCTTTTGGTCCAGAGGGAC 
GGCTGGAAGATAAGCTCCACAAACCCAAAGCTACACAGACTGAGGTCAAACCATCTGTGAGG 
TTTAACCTCCGCACCTCCAAGGACCCAGAGCATGAAGGATGCTACCTCTCCGTCGGCCACAG 
CCAGCCCTTAGAAGACTGCAGTTTCAACATGACAGCTAAAACCTTTTTCATCATTCACGGAT 
GGACGATGAGCGGTATCTTTGAAAACTGGCTGCACAAACTCGTGTCAGCCCTGCACACAAGA 
GAGAAAGACGCCAATGTAGTTGTGGTTGACTGGCTCCCCCTGGCCCACCAGCTTTACACGGA 
TGCGGTCAATAATACCAGGGTGGTGGGACACAGCATTGCCAGGATGCTCGACTGGCTGCAGG 
AGAAGGACGATTTTTCTCTCGGGAATGTCCACTTGATCGGCTACAGCCTCGGAGCGCACGTG 
GCCGGGTATGCAGGCAACTTCGTGAAAGGAACGGTGGGCCGAATCACAGGTTTGGATCCTGC 
CGGGCCCATGTTTGAAGGGGCCGACATCCACAAGAGGCTCTCTCCGGACGATGCAGATTTTG 
TGGATGTCCTCCACACCTACACGCGTTCCTTCGGCTTGAGCATTGGTATTCAGATGCCTGTG 
GGCCACATTGACATCTACCCCAATGGGGGTGACTTCCAGCCAGGCTGTGGACTCAACGATGT 
CTTGGGATCAATTGCATATGGAACAATCACAGA6GTGGTAAAATGTGAGCATGAGCGAGCCG 
TCCACCTCTTTGTTGACTCTCTGGTGAATCAGGACAAGCCGAGTTTTGCCTTCCAGTGCACT 
GACTCCAATCGCTTCAAAAAGGGGATCTGTCTGAGCTGCCGCAAGAACCGTTGTAATAGCAT 
TGGCTACAATGCCAAGAAAATGAGGAACAAGAGGAACAGCAAAATGTACCTAAAAACCCGGG 
CAGGCATGCCTTTCAGAGGTAACCTTCAGTCCCTGGAGTGTCCCTGAGGAAGGCCCTTAATA 
CCTCCTTCTTAATACCATGCTGCAGAGCAGGGCACATCCTAGCCCAGGAGAAGTGGCCAGCA 
CAATCCAATCAAATCGTTGCAAATCAGATTACACTGTGCATGTCCTAGGAAAGGGAATCTTT 
ACAAAATAAACAGTGTGGACCCCTAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAA 



FIGURE 66 

></usr/ seqdb2/sst/DNA/Dnaseqs .min/ss . DNA49646 
xsubunit 1 of 1, 354 aa, 1 stop 
xMW: 39362, pi: 8.35, NX(S/T): 2 

MSNSVPLLCFWSLCYCFAAGSPVPFGPEGRLEDKIiHKPKATQTEVKPSVRFNLRTSKDPEHE 
GCYLSVGHSQPLEDCSFNMTAKTFFI IHGWTMSGI FENWLHKLVSALHTREKDANWWDWL 
PLMIQLYTDAVmTRWGHSIARMLDWLQEKDDFSLGlSrraLIGYSLGAHVAGYAGNFVKGTV 
GRITGLDPAGPMFEGADIHKRLSPDDADFVDVLHTYTRSFGLSIGIQMPVGHIDIYPNGGDF 
QPGCGLNDVLGSIAYGTITEWKCEHERAVHLFVDSLVWQDKPSFAFQCTDSNRFKKGICLS 
CRKNRCNS IGYMAKKMRNKRNSKMYLKTRAGMPPRGNLQSLECP 

Important features: 
Signal peptide: 

amino acids 1-16 

Lipases, serine active site. 

amino acids 163-172 



N-glycosylation sites. 

amino acids 80-83 and 136-139 



FIGURE 67 

CGGACGCGTGGGCGGACGCGTGGGCCTGGGCAAGGGCCGGGGCGCCGGGCCGAGCCACCTCTTCCCCTCCCCCGC 

TTCCCTGTCGCGCTCCGCTGGCTGGACGCGCTGGAGGAGTGGAGCAGCACCCGGCCGGCCCTGGGGGCTQACAGT 

CGGCaAAGTTTGGCCCGAAGAGGAAGTGGTCTO^CCCCGGCaGGTGGCGACCAGGCCAGACC^ 

CTGCCTGCGGGCGGGCTGTAGGCGAGGGCGCGCCCCAGTGCCGAGACCCGGGGCTTCAGGAGCCGGCCCCGGGAG 

AGAAGAGTGCGGCGGCGGACGGAGAAAACyyiCTCCT^GTTGGCGAAAGGCACCGCCCCTACTCCCGGGCTGC 

CCGCCTCCCCGCCCCCAGCCCTGGCATCCAGAGTACGGGTCGAGCCCGGGCCATGGAGCCCCCCTGGGGAGGCGG 

CACCAGGGAGCCTGGGCGCCCGGGGCTCCGCCGCGACCCCATCGGGTAGACCACAGAAGCTCCGGGACCCTTCCG 

GCACCTCTGGACAGCCCAGGATGCTGTTGGCCACCCTCCTCCTCCTCCTCCTTGGAGGCGCTCTGGCCC^^ 

ACCGGATTATTTTTCGAAATOlTGCTTGTGAGGACCCCCCAGCaGTGCTCTTAGAAGTGCAGGGCACCTTACAGA 

GGCCCCTGGTCCGGGACAGCCGOiLCCTCCCCTGCCAACTGCACCTGGCTCATCCTGGGCy^GCAAG^ 

TCACCATCAGGTTCCAGAAGCTACACCTGGCCTGTGGCTCAGAGCGCTTAACCCTACGCTCCCCrCTCCAGCCAC 

TGATCTCCCTGTGTGAGGCACCTCCCAGCCCTCTGCaGCTGCCCGGGGGCAACGTCACCATCACTTACAGCTATC 

CTGGGGCCAGAGCACCCATGGGCCAGGGCTTCCTGCTCTCCTACAGCCAAGATTGGCrGATGTGCCTGCAa 

AGTTTCAGTGCCTGAACCACCGCTGTGTATCTGCTGTCCAGCGCTGTGATGGGGTTGATGCCTGTGGCGATG6CT 

CTGATGZ^GCaGGTTGCAGCTCAGACCCOTTCCCTGGCCTQACCCCAAGACCCGTCCCCTCCCTGCCTTGCAATG 

TCy^CCTTGGAGGACTTCTATGGGGTCTTCTCCTCTCCTGGATATACACACCTAGCCTCSlGTCTCCCACCCCCAGT 

CCTGCCATTGGCTGCTGGACCCCCATGATGGCCGGCGGCTGGCCGTGCGCTTCACAGCCCTGGACTTGGGCTTTG 

GAGATGCaGTGCATGTGTATGACGGCCCTGGGCCCCCTGAGAGCTCCCGACTACTGCGTAGTCTCACCCACTTCA 

GCAATGGC?^GGCTGTCa.CTGTGGAGAC:ACTGTCTGGCCyiGGCTGTTGTGTCCTACCS^CACAG^^ 

ATGGTCGTGGCTTCAATGCCACCTACCATGTGCGGGGCTATTGCTTGCCTTGGGACAGACCCTGTGGCTTAGGCT 

CTGGCCTGGQAGCTGGCGAAGGCCTAGGTGAGCGCTGCTACAGTGAGGCACftGCGCTGTGACGGCTCa^TGGGAOT 

GTGCTGACGGCaCAGATGAGGAGGACTGCCCAGGCTGCCCACCTGGACACTTCCCCTGTGGGGCTGCTGGCACCT 

CTGGTGCCACAGCCTGCTACOTGCCTGCTGACCGCTGC^CTACCAGACTTTCTGTGCTGATGGAGCAGATGAGA 

GACGCTGTCGGCATTGCCAGCCTGGCAATTTCC6ATGCCGGGACGAGAAGTGCGTGTATGAGACGTGGGTGTGCG 

ATGGGCaVGCCaGACTGTGCGGACGGCAGTGATGAGTGGGACTGCTCCTATGTTCTGCCCCGOUlGGTaVTTACy^^ 

CTGCa.GTCATTGGCAGCCTAGTGTGCGGCCTGCTCCTGGTCATCGCCCTGGGCTGCACCTGCAAGCTCTATGCCA 

TTCGCACCCAGGAGTACAGCATCTTTGCCCCCCTCTCCCGGATGGAGGCTQAGATTGTGCAGCAGCAGGCACCCC 

CTTCCTACGGGCa.GCTCATTGCCCAGGGTGCCATCCCa.CCTGTAGAAGACTTTCCTACAQAGAATCCTAATGATA 

ACTCAGTGCTGGGCAACCTGCGTTCTCTGCTACAGATCTTACGCCAGGATATGACTCCAGGAGGTGGCCCAGGTG 

CCCGCCGTaSTCAGCGGGGCCGCTTGATGCGACGCCTGGTACGCCGTCTCCGCCGCTGGGGCTTGCTCCCTCGAA 

CCAACACCCCGGCTCGGGCCTCTGAGGCCAGATCCCAGGTCACACCTTCTGCTGCTCCCCTTGAGGCCCTAGATG 

GTGGCACAGGTCCaGCCCGTGAGGGCGGGGOlGTGGGTGGGCS^GATGGGGAGCAGGCT^CCCCCaCTGCCC^^ 

AGGCTCCCCTCCCATCTGCTAGCACGTCTCCAGCCCCCACTACTGTCCCTGAAGCCCCAGGGCCACTGCCCTCAC 

TGCCCCTAGAGCCATCACTATTGTCTGGAGTGGTGCAGGCCCTGCGAGGCCGCCTGTTGCCCAGCCTGGGGCCCC 

CS^GGACCaACCCGGAGCCCCCOTGGACCCaVCACAGO^GTCCTGGCCCTGGAAGATGAGGACGATGTGCT^^ 

TGCCaCTGGCTQAGCCGGGGGTGTGGGTAGCTGAGGCAGAGGATGAGCCACTGCTTACCTGAGGGGACCTGGGGG 

CTCTACTGAGGCCTCTCCCCTGGGGGCTCTACTCATAGTGGCACAACCTTTTAGAGGTGGGTCAGCCTCCCCTCC 

ACCACTTCCTTCCCTGTCCCTGGATTTCAGGGACTTGGTGGGCCTCCCGTTGACCCTATGTAGCTGCTATAAAGT 

TAAGTGTCCCTCAGGCAGGGAGAGGGCTCACAGA6TCTCCTCTGTACGTGGCCATGGCCAGACACCCCAGTCCCT 

TCACCACCACCTGCTCCCCACGCCACCaCCATTTGGGTGGCTGTTTTTAAAAAGTAAAGTTCTTAGAGGATCATA 

GGTCTGGACaCTCCyVTCCTTGCCy^CCTCTACCCAAAAGTGGCCTTAAGCACCGGAATGCCaATTAACT^ 

CCCTCCAGCCCCCAAGGGGAGGATTTGGGCaGAACCTGAGGTTTTGCCATCCACAATCCCTCCTACAGGGCCTGG 

CTO^CAAAAAQAGTGCaACS^TGCTTCTATTCCATAGCTACGGCATTGCTCAGTAAGTTGAGGTCa^^ 

GGAATCATACATCTC 



FIGURE 68 



< / usr/ seqdb2 / sst /DNA/Dnaseqs . min/ ss . DNA4 9631 
<subunit 1 of 1, 713 aa, 1 stop 
<MW: 76193, pi: 5.42, NX(S/T): 4 

MLLATLLLLLLGGALAHPDRI I FPNHACEDPPAVLLEVQGTLQRPLVRDSRTSPANCTWLIL 
GSKEQTVTIRFQKLHLACGSERLTLRSPLQPLISLCEAPPSPLQLPGGNVTITYSYAGARAP 
MGQGFLLSYSQDWLMCLQEEFQCLNHRCVSAVQRCDGVDACGDGSDEAGCSSDPFPGLTPRP 
VPS LPCNVTLEDF YGVFS S PGYTHLAS VSHPQSCHWLLDPHDGRRLAVRFTALDLGFGDAVH 
VYDGPGPPESSRLLRSLTHFSNGKAVTVETLSGQAWSYHTVAWSNGRGFNATYHVRGYCLP 
WDRPCGLGSGLGA6EGLGERCYSEAQRCDGSWDCADGTDEEDCPGCPPGHFPCGAAGTSGAT 
ACYLPADRClSryQTFCADGADERRCRHCQPGNFRCRDEKCVYETWVCDGQPDCADGSDEWDCS 
YVLPRKVITAAVIGSLVCGLLLVIALGCTCKLYAIRTQEYSIFAPLSRMEAEIVQQQAPPSY 
GQLIAQGAIPPVEDFPTENPNDNSVLGNLRSLLQILRQDMTPGGGPGARRRQRGRLMRRLVR 
RLRRWGLLPRTNTPARASEARSQVTPSAAPLEALDGGTGPAREGGAVGGQDGEQAPPLPIKA 
PLPSASTSPAPTTVPEAPGPLPSLPLEPSLLSGWQALRGRLLPSLGPPGPTRSPPGPHTAV 
LALEDEDDVLLVPU^PGVWVAEAEDEPLLT 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 442-462 

LDL-receptor class A (LDLRA) domain proteins 

amino acids 411-431, 152-171, 331-350 and 374-393 



FIGURE 69 

CGAGCTGGGCGAGAAGTAGGGGAGGGCGGTGCTCCGCCGCGGTGGCGGTTGCTATCGCTTCG 
CAGAACCTACTCAGGCAGCCAGCTGAGAAGAGTTGAGGGAAAGTGCTGCTGCTGGGTCTGCA 
GACGCGATGGATAACGTGCAGCCGAAAATAAAACATCGCCCCTTCTGCTTCAGTGTGAAAGG 
CCACGTGAAGATGCTGCGGCTGGCACTAACTGTGACATCTATGACCTTTTTTATCATCGCAC 
AAGCCCCTGAACCATATATTGTTATCACTGGATTTGAAGTCACCGTTATCTTATTTTTCATA 
CTTTTATATGTACTCAGACTTGATCGATTAATGAAGTGGTTATTTTGGCCTTTGCTTGATAT 
TATCAACTCACTGGTAACAACAGTATTCATGCTCATCGTATCTGTGTTGGCACTGATACCAG 
AAACCACAACATTGACAGTTGGTGGAGGGGTGTTTGCACTTGTGACAGCAGTATGCTGTCTT 
GCCGACGGGGCCCTTATTTACCGGAAGCTTCTGTTCAATCCCAGCGGTCCTTACCAGAAAAA 
GCCTGTGCATGAAAAAAAAGAAGTTTT GTAAT TTTATATTACTTTTTAGTTTQATACTAAGT 
ATTAAACATATTTCTGTATTCTTCCAAAAAAAAAAAAAAAAAA 



FIGURE 70 



></usr/seqdb2/sst/DNA/Dnasegs .tnin/ss .DNA49645 
xsiibunit 1 of 1, 152 aa, 1 stop 
xMW: 17170, pi: 9.62, NX(S/T): 1 

MDNVQPKIKHRPFCFSVKGHVKMLRIALTVTSMTFFIIAQAPEPYIVITGFEVTVILFFILL 
YVLRLDRLMKWLFWPLLDIINSLVTTVFMLIVSVIJyiilPETTTLTVGGGVFALVTAVCCLAD 
GALIYRKLLFNPSGPYQKKPVHEKKEVL 

Important features: 

Potential type II transmembrane domain: 

amino acids 26-42 

Other potential transmembrane domain: 

amino acids 44-65, 81-101 and 109-129 

Leucine zipper pattern 

amino acids 78-99 and 85-106 

N-myristoylation site. 

amino acids 110-115 

Ribonucleotide reductase large sxibunit protein 

amino acids 116-127 



FIGURE 71 



GGGCGAGAAGTAGGGGAGGGCGTGTTCCGCCGCGGTGGCGGTTGCTATCGTTTTGCAGAACC 
TACTCAGGCAGCCAGNTGAGAAGAGTTGAGGGAAAGTGCTGCTGCTGGGTCTGCAGACGCGA 
TGGATAACGTGCAGCCGAAAATAAAACATCGCCCCTTCTGCTTCAGTGTGAAAGGCCACGTG 
AAGATGCTGCGGCTGGCACTAACTGNGACATCTATGACCTTTTTTATNATCGCACAAGCCCC 
TGAACCATATATTGTTATCACTGGATTTGAAGTCACCGTTATCTTATTTTTCATACTTTTAT 
ATGTACTCAGACTTGATCGATTAATGAAGTGGTTATTTTGGCCTTTGCTTGATATTATCAAC 
TCACTGGTAACAACAGTATTCATGCTCATCGTATCTGTGTTGGCACTGATACCAGAAACCAC 
AACATTGACAGTTGGTGGAGGGGTGTTTGCACTTGTGACAGCAGTATGCTGTNTTGCCGAC 



FIGURE 72 



CAGCCCCGCGCGCCGGCCGAGTCGCTGAGCCGCGGCTGCCGGACGGGACGGGACCGGCTAGG 
CTGQGCGCGCCCCCCGGGCCCCGCCGTGGGC ATGG GCGCACTGGCCCGGGCGCTGCTGCTGC 
CTCTGCTGGCCCAGTGGCTCCTGCGCGCCGCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTG 
CCCCTCCGGGTGGCCGCGGCCACGAACCGCGTA6TTGCGCCCACCCCGGGACCCGGGACCCC 
TGCCGAGCGCCACGCCGACGGCTTGGCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGG 
GCGCCGCCAACTTCTTGGCCATGGTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTAC 
CTGGAGATGCTGATCGGGACCCCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAG 
TAACTTTGCCGTGGCAGGAACCCCGCACTCCTACATAGACACGTACTTTGACACAGAGAGGT 
CTAGCACATACCGCTCCAAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACG 
GGCTTCGTTGGGGAAGACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAA 
CATTGCCACTATTTTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATAC 
TTGGCCTAGCTTATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCC 
CTGGTGACACAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCC 
CGTTGCTGGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGT 
ATAAAGGAGACATCTGGTATACCCCTATTAAGGAAGAGTGGTACTACCAGATAGAAATTCTG 
AAATTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGC 
CATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTGG 
AAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCCCAG 
CTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATCTACCT 
GAGAGACGAGAACTCCAGCAGGTCATTCCGTATCACAATCCTGCCTCAGCTTTACATTCAGC 
CCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCATCCACAAAT 
GCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGACAGAGCCCAGAA 
GAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCAGTGTCTGAAATTT 
CCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCTCAGTCTTTGAGCGAG 
CCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGAGCCATCCTCCTTGTCTT 
AATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCCCGTGACCCTGAGGTCGTCA 
ATGATGAGTCCTCTCTGGTCAGACATCGCTGGAA ATGAA TAGCCAGGCCTGACCTCAAGCAA 
CCATGAACTCAGCTATTAAGAAAATCACATTTCCAGGGCAGCAGCCGGGATCGATGGTGGCG 
CTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCTGCTCCCAGATGCCTTCTAGATTCAC 
TGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATCCTCCCTACTTCCAAGAAAAATAATTAA 
AAAAAAAACTTCATTCTAA 



iilHlllll Hill I II 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA45493 
xstibunit 1 of 1, 518 aa., 1 stop 
xMW: 56180, pi: 5.08, NX(S/T) : 2 

MGALARALLLPLLAQWLLRAAPEI^PAPFTLPLRVAAATimWAPTPGPGTPAERHADGLMi 
ALEPALASPAGAANFLAMVDNLQGDSGRGyyLEMLIGTPPQKLQILVDTGSSNFAVAGTPHS 
YIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENF 
FLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGSGTNGGS 
LVLGGIEPSLYKGDIWYTPIKEEPJYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLR 
LPQKVFDAWEAVARASLI PEFSDGFWTGSQLACWTNSETPWSYFPKI S lYLRDENSSRSFR 
ITILPQLYIQPMMGAGIjIO'ECYRFGISPSTNALVIGATVMEGFYVIFDRAQKRVGFAASPCA 
EIAGAAVSEISGPFSTEDVASNCVPAQSLSEPILWIVSYALMSVCGAILLVLIVLLLLPFRC 
QRRPRDPEWNDESSLVRHRWK 

Important features: 
Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 466-494 

N-glycosylation sites. 

amino acids 170-173 and 366-369 

Leucine zipper pattern. 

amino acids 10-31 and 197-118 

Eukaryotic and viral aspartyl proteases 

amino acids 109-118, 252-261 and 298-310 



FIGURE 74 



CGCCTCCGCCTTCGGAGGCTGACGCGCCCGGGCGCCGTTCCAGGCCTGTGCAGGGCGGATCG 
GCAGCCGCCTGGCGGCGATCCAGGGCGGTGCGGGGCCTGGGCGGGAGCCGGGAGGCGCGGCC 
GGCATGGAGGCGCTGCTGCTGGGCGCGGGGTTGCTGCTGGGCGCTTACGTGCTTGTCTACTA 
CAACCTGGTGAAGGCCCCGCCGTGCGGCGGCATGGGCAACCTGCGGGGCCGCACGGCCGTGG 
TCACGGGCGCCAACAGCGGCATCGGAAAGATGACGGCGCTGGAGCTGGCGCGCCGGGGAGCG 
CGCGTGGTGCTGGCCTGCCGCAGCCAGGAGCGCGGGGAGGCGGCTGCCTTCGACCTCCGCCA 
GGAGAGTGGGAACAATGAGGTCATCTTCATGGCCTTGGACTTGGCCAGTCTGGCCTCGGTGC 
GGGCCTTTGCCACTGCCTTTCTGAGCTCTGAGCCACGGTTGGACATCCTCATCCACAATGCC 
GGTATCAGTTCCTGTGGCCGGACCCGTGAGGCGTTTAACCTGCTGCTTCGGGTGAACCATAT 
CGGTCCCTTTCTGCTGACACATCTGCTGCTGCCTTGCCTGAAGGCATGTGCCCCTAGCCGCG 
TGGTGGTGGTAGCCTCAGCTGCCCACTGTCGGGGACGTCTTGACTTCAAACGCCTGGACCGC 
CCAGTGGTGGGCTGGCGGCAGGAGCTGCGGGCATATGCTGACACTAAGCTGGCTAATGTACT 
GTTTGCCCGGGAGCTCGCCAACCAGCTTGAGGCCACTGGCGTCACCTGCTATGCAGCCCACC 
CAGGGCCTGTGAACTCGGAGCTGTTCCTGCGCCATGTTCCTGGATGGCTGCGCCCACTTTTG 
CGCCCATTGGCTTGGCTGGTGCTCCGGGCACCAAGAGGGGGTGCCCAGACACCCCTGTATTG 
TGCTCTACAAGAGGGCATCGAGCCCCTCAGTGGGAGATATTTTGCCAACTGCCATGTGGAAG 
AGGTGCCTCCAGCTGCCCGAGACGACCGGGCAGCCCATCGGCTATGGGAGGCCAGCAAGAGG 
CTGGCAGGGCTTGGGCCTGGGGAGGATGCTGAACCCGATGAAGACCCCCAGTCTGAGGACTC 
AGAGGCCCCATCTTCTCTAAGCACCCCCCACCCTGAGGAGCCCACAGTTTCTCAACCTTACC 
CCAGCCCTCAGAGCTCACCAGATTTGTCTAAGATGACGCACCGAATTCAGGCTAAAGTTGAG 
CCTGAGATCCAGCTCTCCTAACCCTCAGGCCAGGATGCTTGCCATGGCACTTCATGGTCCTT 
GAAAACCTCGGATGTGTGTGAGGCCATGCCCTGGACACTGACGGGTTTGTGATCTTGACCTC 
CGTGGTTACTTTCTGGGGCCCCAAGCTGTGCCCTGGACATCTCTTTTCCTGGTTGAAGGAAT 
AATGGGTGATTATTTCTTCCTGAGAGTGACAGTAACCCCAGATGGAGAGATAGGGGTATGCT 
AGACACTGTGCTTCTCGGAAATTTGGATGTAGTATTTTCAGGCCCCACCCTTATTGATTCTG 
ATCAGCTCTGGAGCAGAGGCAGGGAGTTTGCAATGTGATGCACTGCCAACATTGAGAATTAG 
TGAACTGATCCCTTTGCAACCGTCTAGCTAGGTAGTTAAATTACCCCCATGTTAATGAAGCG 
GAATTAGGCTCCCGAGCTAAGGGACTCGCCTAGGGTCTCACAGTGAGTAGGAGGAGGGCCTG 
GGATCTGAACCCAAGGGTCTGAGGCCAGGGCCGACTGCCGTAAGATGGGTGCTGAGAAGTGA 
GTCAGGGCAGGGCAGCTGGTATCGAGGTGCCCCATGGGAGTAAGGGGACGCCTTCCGGGCGG 
ATGCAGGGCTGGGGTCATCTGTATCTGAAGCCCCTCGGAATAAAGCGCGTTGACCGCCAAAA 
AAAAAAAAAAAAAAAAA 



FIGURE 75 

< / usr / seqdb2 / s s t /DNA/Dna seqs . min/ s s . DNA4 8227 
<subunit 1 of 1, 377 aa, 1 stop 
<MW: 40849, pi: 7.98, NX(S/T): 0 

MEALLLGAGLLLGAYVLVYYNLVKAPPCGGMGNLRGRTAWTGJ^SGIGKMTALELARRGAR 
WLACRSQERGEAAAFDLRQESGNNEVIFMALDLASLASVRAFATAFLSSEPRLDILIHNAG 
ISSCGRTREAFNLLLRVNHIGPFLLTHLLLPCLKACAPSRVVWASAAHCRGRLDFKRLDRP 
WGWRQELRAYADTKLANVLFARELANQLEATGVTCYAAHPGPVNSELFLRHVPGWLRPLLR 
PLAWLVLRAPRGGAQTPLYCALQEGIEPLSGRYFANCHVEEVPPAARDDRAAHRLWEASKRL 
AGLGPGEDAEPDEDPQSEDSEAPSSLSTPHPEEPTVSQPYPSPQSSPDLSKMTHRIQAKVEP 
EIQLS 

Important features: 
Signal peptide: 

amino acids 1-16 

Glycosaminoglycan attachment site. 

amino acids 46-49 

Short-chain alcohol dehydrogenase family 

amino acids 37-49 and 114-124 



FIGURE 76 



GGAGGAGACAGCCTCCTGGGGGGCAGGGGTTCCCTGCCTCTGCTGCTCCTGCTCATCaTGGGAGGC^^ 

GACTCCCCGCCCCAGATCCTAGTCCACCCCCAGGACCAGCTGTTCCAGGGCCCTGGCCCTGCCAGGATGAGCTGC 

CAAGCCTCAGGCCAGCCACCTCCCACCATCCGCTGGTTGCTGaATGGGCAGCCCCTGAGCATGGTGCCCCCAGA^ 

CCACaCCACCTCOTGCCTGATGGGACCCTTCTGCTGCTACAGCCCCCTGCCCGGGGACATGCCCACGATGGCCAG 

GCCCTGTCCACAQACCTGGGTGTCTACACATGTGAGGCCAGCAACCGGCTTGGCACGGGAGTCAGCAGAGGC^^ 

CGGCTGTCTGTGGCTGTCCTCCGGGAGGATTTCCAGATCCAGCCTCGGGACATGGTGGCTGTGGTGGGTGAGCAG 

TTTACTCTGQAATGTGGGCCGCCCTGGGGCCACCCAGAGCCCACAGTCTCATGGTGGAAAGATGGGAAACCCCTG 

GCCCTCCAGCCCGGAAGGCACACAGTGTCCGGGGGGTCCCTGCTGATGGCAAGAGCAGAGAAQAGTGACGAAGGG 

ACCTACATGTGTGTGGCCACCAACAGCGCAGGACATAGGGAGAGCCGCGCAGCCCGGGTTTCCATCCAGQAGCCC 

CAGGACTACACGQAGCCTGTGGAGCTTCTGGCTGTGCGAATTCAGCTGGAAAATGTGACACTGCTGAACCCGGAT 

CCTGCAGAGGGCCCCAAGCCTAGACCGGCGGTGTGGCTCAGCTGGAAGGTCAGTGGCCCTGCTGCGCCTGCCCAA 

TCTTACACGGCCTTGTTCAGGACCCAGACTGCCCCGGGAGGCCAGGGAGCTCCGTGGGCAGAGGAGCTGCTGGCC 

GGCTGGCAGAGCGCAGAGCTTGGAGGCCTCCACTGGGGCCAAGACTACGAGTTCAAAGTGAGACCATCCTCTGGC 

CGGGCTCGAGGCCCTGACa.GCAACGTGCTGCTCCTGAGGCTGCCGGAAAAAGTGCCCAGTGCCCCACCTCAGGAA 

GTGACTCTAAAGCCTGGCAATGGCACTGTCTTTGTGAGCTGGGTCCCACCACCTGCTGAAAACCACAATGGCATC 

ATCCGTGGCTACCAGGTCTGGAGCCTGGGCaLACACATCACTGCCACCAGCCaAOTGGACTC^ 

ACCCaGCTGGAAATCGCCACCCATATGCCAGGCTCCTACTGCGTGCAAGTGGCTGCAGTCACTGGTGCTGGAGCT 

GGGGAGCCCAGTAGACCTGTCTGCCTCCTTTTAGAGCAGGCCATGGAGCGAGCCACCCAAGAACCCAGTGAGCAT 

GGTCCCTGGACCCTGGAGCAGCTGAGGGCTACCTTGAAGCGGCCTGAGGTCATTGCCACCTGCGGTGTTGCACrC 

TGGCTGCTGCTTCTGGGCACCGCCGTGTGTATCCACCGCCGGCGCCGAGCTAGGGTGCACCTGGGCCCAGGTCTG 

TACaQATATACCAGTGAGGATGCCATCCTAAAACACAGGATGGATCaCAGTGACrCCCAGTGGTTGGa^GA(^^ 

TGGCGTTCCACCTCTGGCTCTCGGGACCTGAGOlGCAGCAGCAGCOTCyiGCAGTCGGCTGGGGGCGGATGCCCG^ 

GACCCACTAGACTGTCGTCGCTCCTTGCTCTCCTGGGACTCCCGAAGCCCCGGCGTGCCCCTGCTTCCAGACACC 

AGOiCTTTTTATGGCTCCCTaVTCGCTGAGCTGCCCTCCAGTACCCCAGCCAGGCCAAGTCCCCAGGTCCCAGCT 

GTCaiGGCGCCTCCCACCCCAGCTGGCCCAGCTCTCCAGCCCCTGTTCCAGCTCAGACAGCCTCTGCAGCCGCAGG 

GGACTCTCTTCTCCCCGCTTGTCTCTGGCCCCTGCAGAGGCTTGGAAGGCCAAAAAGAAGCAGGAGCTGCAGCAT 

GCCAACAGTTCCCCACTGCTCCGGGGCAGCCACTCCTTGGAGCTCCGGGCCTGTGAGTTAGGAAATAGAGGTTCC 

AAGAACCTTTCCCAAAGCCCAGGAGCTGTGCCCCAAGCTCTGGTTGCCTGGCGGGCCCTGGGACCGAAACTCCTC 

AGCTCCTCAAATGAGCTGGTTACTCGTCATCTCCCTCCAGCACCCCTCTTTCCTCATGAAACTCCCCCAACTCAG 

AGTCAACAGACCCAGCCTCCGGTGGCACCACaGGCTCCCTCCTCCATCCTGCTGCCAGCAGCCCCCATCCCCATC 

CTTAGCCCCTGCAGTCCCCCTAGCCCCCAGGCCTCTTCCCTCTCTGGCCCCAGCCCAGCTTCCAGTCGCCTGTCC 

AGCTCCTCACTGTCATCCCTGGGGGAGGATCAAGACAGCGTGCTGACCCCTGAGGAGGTAGCCCTGTGCTTGGAA 

CTCAGTGAGGGTQAGGAGACTCCCAGQAACAGCGTCTCTCCCATGCCAAGGGCTCCTTCACCCCCCACCACCTAT 

GGGTACATCAGCGTCCO^CAGCOTCAQAGTTCACGGACATGGGCAGGACTGGAGGAGGGGTGGGGCCCAAGGGG 

GGAGTCTTGCTGTGCCCACCrCGGCCCTGCCTCACCCCCACCCCCAGCGAGGGCTCCTTAGCCAATGGTTGGGGC 

TCAGCCTCTGAGGACAATGCCGCCAGCGCCAGAGCCAGCCTTGTCAGCTCCTCCGATGGCTCCTTCCTCGCTGAT 

GCTCACTTTGCCCGGGCCCTGGCAGTGGCTGTGGATAGCTTTGGTTTCGGTCTAGAGCCCAGGGAGGCAGACTGC 

GTCTTCATAQATGCCTCATCACCTCCCTCCCCACGGGATGAGATCTTCCTGACCCCCAACCTCTCCCTGCCCCTG 

TGGGAGTGGAGGCCAGACTGGTTGGAAGACATGGAGGTCAGCCACACCCAGCGGCTGGGAAGGGGGATGCCTCCC 

TGGCCCCCTGACTCTCAGATCTCTTCCCAGAGAAGTCAGCTCCACTGTCGTATGCCCAAGGCTGGTGCTTCTCCT 

GTAGATTACTCCTGAACCGTGTCCCTGAGACTTCCCAGACGGGAATa^GAACCACTTCTCCTGTCCaCCCAC^ 

ACCTGGGCT6TGGTGTGTGGGTCTTGGCCTGTGTTTCTCTGCAGCTGGGGTCCACCTTCCCAAGCCTCCAGAGAG 

TTCTCCCTCOiCGATTGTGAAAACAAATGAAAACAAAATTAGAGCAAAGCTGACCTGGAGCCCTCAGGGAGCAAA 

ACATCATCTCCACCTGACTCCTAGCCAOTGCTTTCTCCTCTGTGCCATCCACTCCCACCACCaGGTTGTra 

CTGAGGAGCAGCCCTGCCTGCTGCTCTTCCCCCACCATTTGGATCACAGGAAGTGGAGGAGCCAGAGGTGCCTTT 

GTGGAGGACAGCAGTGGCTGCTGGGAGAGGGCTGTGGAGGAAGGAGCTTCTCGGAGCCCCCTCTCAGCCTTACCT 

GGGCCCCTCCTCTAGAGAAGAGCTCAACTCTCTCCCAACCTCACCATGGAAAGAAAATAATTATGAATGCCACTG 

AGGCACTGAGGCCCTACCTCATGCCaAACAAAGGGTTCaAGGCrGGGTCTAGCGAGGATGCTGAAGGAAGGGAC^ 

TATGAGACCGTAGGTCAAAAGCACCATCCTCGTACTGTTGTCACTATGAGCTTAAGAAATTTGATACCATAAAAT 

GGTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 77 



< /us r / s eqdb2 / s s t / DNA/Dna seqs . min/ s s . DNA4 1404 
<subunit 1 of 1, 985 aa, 1 stop 
<MW: 105336, pi: 6.55, KX(S/T): 7 

MGGMAQDS PPQI LVHPQDQLFQGPGPARMSCQASGQPPPTIRWLLNGQPLSMVPPDPHHLLP 
DGTLLLLQPPARGHAHDGQALSTDLGVYTCEASNRLGTAVSRGARLSVAVLREDFQIQPRDM 
VAWGEQFTLECGPPWGHPEPTVSWWKDGKPLALQPGRHTVSGGSLLMARAEKSDEGTYMCV 
ATNSAGHRESRAARVSIQEPQDYTEPVELLAVRIQLENVTLLNPDPAEGPKPRPAVWLSWKV 
SGPAAPAQSYTALFRTQTAPGGQGAPWAEELLAGWQSAELGGLHWGQDYEFKVRPSSGRARG 
PDS]Sr\nJLLRLPEKVPSAPPQEVTLKPGNGTVFVSWVPPPAENHNGIIRGYQVWSLGNTSLPP 
ANWTWGEQTQLEIATHMPGSYCVQVAAVTGAGAGEPSRPVCLLLEQAMSRATQEPSEHGPW 
TLEQLRATLKRPEVIATCGVALWLLLLGTAVCIHRRRRARVHLGPGLYRYTSEDAILKHRMD 
HSDSQWLADTWRSTSGSRDLSSSSSLSSRLGADARDPLDCRRSLLSWDSRSPGVPLLPDTST 
FYGSLIAELPSSTPARPSPQVPAVRRLPPQLAQLSSPCSSSDSLCSRRGLSSPRLSLAPAEA 
WKAKKKQELQHANSSPLLRGSHSLELRACELGNRGSKNLSQSPGAVPQALVAWRALGPKLLS 
SSNELVTRHLPPAPLFPHETPPTQSQQTQPPVAPQAPSSILLPAAPIPILSPCSPPSPQASS 
LSGPSPASSRLSSSSLSSLGEDQDSVLTPEEVALCLELSEGEETPRNSVSPMPRAPSPPTTY 
GYISVPTASEFTDMGRTGGGVGPKGGVLLCPPRPCLTPTPSEGSLANGWGSASEDNAASARA 
SLVSSSDGSFLADAHFARALAVAVDSFGFGLEPREADCVFIDASSPPSPRDEIFLTPNLSLP 
LWEWRPDWLEDMEVSHTQRLGRGMPPWPPDSQISSQRSQLHCRMPKAGASPVDYS 

Important features: 
Transmembrane domain: 
amino acids 44 8-467 
N-glycosylation sites: 

amino acids 224-227, 338-341, 367-370, 374-377, 658-661 and 926- 
929 

N-myristoylation sites. 

amino acids 47-52, 80-85, 88-93, 99-104, 105-110, 181-186, 272- 
277, 290-295, 355-360, 403-408, 462-467, 561-566, 652-657, 849- 
854 and 876-881 

Phosphotyrosine interaction domain proteins 

amino acids 740-753 



FIGURE 78 

CTCCCACGGTGTCCAGCGCCCAGAATGCGGCTTCTGGTCCTGCTATGGGGTTGCCTGCTGCT 
CCCAGGTTATGAAGCCCTGGAGGGCCCAGAGGAAATCAGCGGGTTCGAAGGGGACACTGTGT 
CCCTGCAGTGCACCTACAGGGAAGAGCTGAGGGACCACCGGAAGTACTGGTGCAGGAAGGGT 
GGGATCCTCTTCTCTCGCTGCTCTGGCACCATCTATGCAGAAGAAGAAGGCCAGGAGACAAT 
GAAGGGCAGGGTGTCCATCCGTGACAGCCGCCAGGAGCTCTCGCTCATTGTGACCCTGTGGA 
ACCTCACCCTGCAAGACGCTGGGGAGTACTGGTGTGGGGTCGAAAAACGGGGCCCCGATGAG 
TCTTTACTGATCTCTCTGTTCGTCTTTCCAGGACCCTGCTGTCCTCCCTCCCCTTCTCCCAC 
CTTCCAGCCTCTGGCTACAACACGCCTGCAGCCCAAGGCAAAAGCTCAGCAAACCCAGCCCC 
CAGGATTGACTTCTCCTGGGCTCTACCCGGCAGCCACCACAGCCAAGCAGGGGAAGACAGGG 
GCTGAGGCCCCTCCATTGCCAGGGACTTCCCAGTACGGGCACGAAAGGACTTCTCAGTACAC 
AGGAACCTCTCCTCACCCAGCGACCTCTCCTCCTGCAGGGAGCTCCCGCCCCCCCATGCAGC 
TGGACTCCACCTCAGCAGAGGACACCAGTCCAGCTCTCAGCAGT6GCAGCTCTAAGCCCAGG 
GTGTCCATCCCGATGGTCCGCATACTGGCCCCAGTCCTGGTGCTGCTGAGCCTTCTGTCAGC 
CGCAGGCCTGATCGCCTTCTGCAGCCACCTGCTCCTGTGGAGAAAGGAAGCTCAACAGGCCA 
CGGAGACACAGAGGAACGAGAAGTTCTGGCTCTCACGCTTGACTGCGGAGGAAAAGGAAGCC 
CCTTCCCAGGCCCCTGAGGGGGACGTGATCTCGATGCCTCCCCTCCACACATCTGAGGAGGA 
GCTGGGCTTCTCGAAGTTTGTCTCAGCGTAGGGCAGGAGGCCCTCCTGGCCAGGCCAGCAGT 
GAAGCAGTATGGCTGGCTGGATCAGCACCGATTCCCGAAAGCTTTCCACCTCAGCCTCAGAG 
TCCAGCTGCCCGGACTCCAGGGCTCTCCCCACCCTCCCCAGGCTCTCCTCTTGCATGTTCCA 
GCCTGACCTAGAAGCGTTTGTCAGCCCTGGAGCCCAGAGCGGTGGCCTTGCTCTTCCGGCTG 
GAGACTGGGACATCCCTGATAGGTTCACATCCCTGGGCAGAGTACCAGGCTGCTGACCCTCA 
GCAGGGCCAGACAAGGCTCAGTGGATCTGGTCTGAGTTTCAATCTGCCAGGAACTCCTGGGC 
CTCATGCCCAGTGTCGGACCCTGCCTTCCTCCCACTCCAGACCCCACCTTGTCTTCCCTCCC 
TGGCGTCCTCAGACTTAGTCCCACGGTCTCCTGCATCAGCTGGTGATGAAGAGGAGCATGCT 
GGGGTGAGACTGGGATTCTGGCTTCTCTTTGAACCACCTGCATCCAGCCCTTCAGGAAGCCT 
GTGAAAAACGTGATTCCTGGCCCCACCAAGACCCACCAAAACCATCTCTGGGCTTGGTGCAG 
GACTCTGAATTCTAACAATGCCCAGTGACTGTCGCACTTGAGTTTGAGGGCCAGTGGGCCTG 
ATGAACGCTCACACCCCTTCAGCTTAGAGTCTGCATTTGGGCTGTGACGTCTCCACCTGCCC 
CAATAGATCTGCTCTGTCTGCGACACCAGATCCACGTGGGGACTCCCCTGAGGCCTGCTAAG 
TCCAGGCCTTGGTCAGGTCAGGTGCACATTGCAGGATAAGCCCAGGACCGGCACAGAAGTGG 
TTGCCTTTNCCATTTGCCCTCCCTGGNCCATGCCTTCTTGCCTTTGGAAAAAATGATGAAGA 
AAACCTTGGCTCCTTCCTTGTCTGGAAAGGGTTACTTGCCTATGGGTTCTGGTGGCTAGAGA 
GAAAAGTAGAAAACCAGAGTGCACGTAGGTGTCTAACACAGAGGAGAGTAGGAACAGGGCGG 
ATACCTGAAGGTGACTCCGAGTCCAGCCCCCTGGAGAAGGGGTCGGGGGTGGTGGTAAAGTA 
GCACAACTACTATTTTTTTTCTTTTTCCATTATTATTGTTTTTTAAGACAGAATCTCGTGCT 
GCTGCCCAGGCTGGAGTGCAGTGGCACGATCTGCAAACTCCGCCTCCTGGGTTCAAGTGATT 
CTTCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCACGCACCACCACACCTGGCTAATT 
TTTGTACTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGAC 
CTCAAATGAGCCTCCTGCTTCAGTCTCCCAAATTGCCGGGATTACAGGCATGAGCCACTGTG 
TCTGGCCCTATTTCCTTTAAAAAGTGAAATTAAGAGTTGTTCAGTATGCAAAACTTGGAAAG 
ATGGAGGAGAAAAAGAAAAGGAAGAAAAAAATGTCACCCATAGTCTCACCAGAGACTATCAT 
TATTTCGTTTTGTTGTACTTCCTTCCACTCTTTTCTTCTTCACATAATTTGCCGGTGTTCTT 
TTTACAGAGCAATTATCTTGTATATACAACTTTGTATCCTGCCTTTTCCACCTTATCGTTCC 
ATCACTTTATTCCAGCACTTCTCTGTGTTTTACAGACCTTTTTATAAATAAAATGTTCATCA 
GCTGCATAAAAAAAAAAAAAA 



FIGURE 7 9 

< / us r/ seqdb2 / s s t /DNA/ Dnaseqs . min/ s s . DNA4 4196 
<subunit 1 of 1, 332 aa, 1 stop 
<MW: 36143, pi: 5.89, NX(S/T): 1 

MRLLVLLWGCLLLPGYEALEGPEE I SGFEGDTVSLQCTYREELRDHRKYWCRKGGI LFSRCS 
GTIYAEEEGQETMKGRVSIRDSRQELSLIVTLWNLTLQDAGEYWCGVEKRGPDESLLISLFV 
FPGPCCPPSPSPTFQPLATTRLQPKAKAQQTQPPGLTSPGLYPAATTAKQGKTGAEAPPLPG 
TSQYGHERTSQYTGTSPHPATSPPAGSSRPPMQLDSTSAEDTSPALSSGSSKPRVSIPMVRI 
LAPVLVLLSLLSAAGLIAFCSHLLLWRKEAQQATETQRNEKFWLSRLTAEEKEAPSQAPEGD 
VISMPPLHTSEEELGFSKFVSA 

Important features: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 248-269 

N-glycosylation site. 

amino acids 96-99 

Fibrinogen beta and gamma chains C- terminal domain. 

amino acids 104-113 



Ig like V-type domain: 

amino acids 13-128 



FIGURE 80 



TTGTGACTAAAAGCTGGCCTAGCAGGCCAGGGAGTGCAGCTGCAGGCGTGGGGGTGGCAGGA 
GCCGCAGAGCCAGAGCAGACAGCCGAGAAACAGGTGGACAGTGTGAAAGAACCAGTGGTCTC 
GCTCTGTTGCCCAGGCTAGAGTGTACTGGCGTGATCATAGCTCACTGCAGCCTCAGACTCCT 
GGACTTGAGAAATCCTCCTGCCTTAGCCTCCTGCATATCTGGGACTCCAGGGGTGCACTCAA 
GCCCTGTTTCTTCTCCTTCTGTGAGTGGACCACGGAGGCTGGTGAGCTGCCTGTCATCCCAA 
AGCTCAGCTCTGAGCCAGAGTGGTGGTGGCTCCACCTCTGCCGCCGGCATAGAAGCCAGGAG 
CAGGGCTCTCAGAAGGCGGTGGTGCCCAGCTGGGATCATGTTGTTGGCCCTGGTCTGTCTGC 
TCAGCTGCCTGCTACCCTCCAGTGAGGCCAAGCTCTACGGTCGTTGTGAACTGGCCAGAGTG 
CTACATGACTTCGGGCTGGACGGATACCGGGGATACAGCCTGGCTGACTGGGTCTGCCTTGC 
TTATTTCACAAGCGGTTTCAACGCAGCTGCTTTGGACTACGAGGCTGATGGGAGCACCAACA 
ACGGGATCTTCCAGATCAACAGCCGGAGGTGGTGCAGCAACCTCACCCCGAACGTCCCCAAC 
GTGTGCCGGATGTACTGCTCAGATTTGTTGAATCCTAATCTCAAGGATACCGTTATCTGTGC 
CAT6AAGATAACCCAAGAGCCTCAGGGTCTGGGTTACTGGGAGGCCTGGAGGCATCACTGCC 
AGGGAAAAGACCTCACTGAATGGGTGGATGGCTGTGACTTCTAGGATGGACGGAACCATGCA 
CAGCAGGCTGGGAAATGTGGTTTGGTTCCTGACCTAGGCTTGGGAAGACAAGCCAGCGAATA 
AAGGATGGTTGAACGTGAAA 



FIGURE 81 

< /us r / seqdb2 / s s t /DNA/Dnas eqs . min/ s s . DNA5 2187 
<siibunit 1 o£ 1, 146 aa, 1 stop 
<MW: 16430, pi: 5.05, NX{S/T): 1 

MLLALVCLLSCLLPSSEAKI.YGRCELARVLHDFGLDGYRGYSLADWVCLAyFTSGFNAAALD 
YEADGSTNNGIPQINSRRWCSNLTPNVPNVCRMYCSDLLNPNLKDTVICAMKITQEPQGLGY 
WEAWRHHCQGKDLTEWVDGCDF 

Iirtportant features: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 67-72 

Homolgous region to Alpha- lac talbixmin / lysozyme C proteins. 

amino acids 34-58 (catalytic domain) , 111-132 and 66-107 



FIGURE 82 



AGCCGCTGCCCCGGGCCGGGCGCCCGCGGCGGCACC ATGA GTCCCCGCTCGTGCCTGCGTTC 
GCTGCGCCTCCTCGTCTTCGCCGTCTTCTCAGCCGCCGCGAGCAACTGGCTGTACCTGGCCA 
AGCTGTCGTCGGTGGGGAGCATCTCAGAGGAGGAGACGTGCGAGAAACTCAAGGGCCTGATC 
CAGAGGCAGGTGCAGATGTGCAAGCGGAACCTGGAAGTCATGGACTCGGTGCGCCGCGGTGC 
CCAGCTGGCCATTGAGGAGTGCCAGTACCAGTTCCGGAACCGGCGCTGGAACTGCTCCACAC 
TCGACTCCTTGCCCGTCTTCGGCAAGGTGGTGACGCAAGGGACTCGGGAGGCGGCCTTCGTG 
TACGCCATCTCTTCGGCAGGTGTGGCCTTTGCAGTGACGCGGGCGTGCAGCAGTGGGGAGCT 
GGAGAAGTGCGGCTGTGACAGGACAGTGCATGGGGTCAGCCCACAGGGCTTCCAGTGGTCAG 
GATGCTCTGACAACATCGCCTACGGTGTGGCCTTCTCACAGTCGTTTGTGGATGTGCGGGAG 
AGAAGCAAGGGGGCCTCGTCCAGCAGAGCCCTCATGAACCTCCACAACAATGAGGCCGGCAG 
G2VAGGCCATCCTGACACACATGCGGGTGGAATGCAAGTGCCACGGGGTGTCAGGCTCCTGTG 
AGGTAAAGACGTGCTGGCGAGCCGTGCCGCCCTTCCGCCAGGTGGGTCACGCACTGAAGGAG 
AAGTTTGATGGTGCCACTGAGGTGGAGCCACGCCGCGTGGGCTCCTCCAGGGCACTGGTACC 
ACGCAACGCACAGTTCAAGCCGCACACAGATGAGGACCTGGTGTACTTGGAGCCTAGCCCCG 
ACTTCTGTGAGCAGGACATGCGCAGCGGCGTGCTGGGCACGAGGGGCCGCACATGCAACAAG 
ACGTCCAAGGCCATCGACGGCTGTGAGCTGCTGTGCTGTGGCCGCGGCTTCCACACGGCGCA 
GGTGGAGCTGGCTGAACGCTGCAGCTGCAAATTCCACTGGTGCTGCTTCGTCAAGTGCCGGC 
AGTGCCAGCGGCTCGTGGAGTTGCACACGTGCCGATGACCGCCTGCCTAGCCCTGCGCCGGC 
AACCACCTAGTGGCCCAGGGAAGGCCGATAATTTAAACAGTCTCCCACCACCTACCCCAAGA 
GATACTGGTTGTATTTTTTGTTCTGGTTTGGTTTTTGGGTCCTCATGTTATTTATTGCCGAA 
ACCAGGCAGGCAACCCCAAGGGCACCAACCAGGGCCTCCCCAAAGCCTGGGCCTTTGTGGCT 
GCCACTGACCAAAGGGACCTTGCTCGTGCCGCTGGCTGCCCGCATGTGGCTGCCACTGACCA 
CTCAGTTGTTATCTGTGTCCGTTTTTCTACTTGCAGACCTAAGGTGGAGTAACAAGGAGTAT 
TACCACCACATGGCTACTGACCGTGTCATCGGGGAAGAGGGGGCCTTATGGCAGGGAAAATA 
GGTACCGACTTGATGGAAGTCACACCCTCTGGAAAAAAGAACTCTTAACTCTCCAGCACACA 
TACACATGGACTCCTGGCAGCTTGAGCCTAGAAGCCATGTCTCTCAAATGCCCTGAGAAAGG 
GAACAAGCAGATACCAGGTCAAGGGCACCAGGTTCATTTCAGCCCTTACATGGACAGCTAGA 
GGTTCGATATCTGTGGGTCCTTCCAGGCJy^GAAGAGGGAGATGAGAGCAAGAGACGACTGAA 
GTCCCACCCTAGAACCCAGCCTGCCCCAGCCTGCCCCTGGGAAGAGGAAACTTAACCACTCC 
CCAGACCCACCTAGGCAGGCATATAGGCTGCCATCCTGGACCAGGGATCCCGGCTGTGCCTT 
TGCAGTCATGCCCGAGTCACCTTTCACAGCGCTGTTCCTCCATGAAACTGAAAAACACACAC 
ACACACACACACACACACACACACACACACACACACACGGACACACACACACACCTGCGAGA 
GAGAGGGAGGAAAGGGCTGTGCCTTTGCAGTCATGCCCGAGTCACCTTTCACAGCACTGTTCCTC 



FIGURE 83 

< /us r / seqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA4 8328 
<siibunit 1 of 1, 351 aa, 1 stop 
<MW: 39052, pi: 8.97, NX(S/T): 2 

MSPRSCLRSLRLLVFAVFSAAASNWLYLAKLSSVGSISEEETCEKLKGLIQRQVQMCKRNLE 
VMDSVRRGAQLAIEECQYQFRNRRWNCSTLDSLPVFGKWTQGTREAAFVYAISSAGVAFAV 
TRACSSGELEKCGCDRTVHGVSPQGFQWSGCSDNIAYGVAFSQSFVDVRERSKGASSSRALM 
NLHNNEAGRKAILTHMRVECKCHGVSGSCEVKTCWRAVPPFRQVGHALKEKFDGATEVEPRR 
VGSSRALVPRNAQFKPHTDEDLVYLEPSPDFCEQDMRSGVLGTRGRTCNKTSKAIDGCELLC 
CGRGFHTAQVEIAERCSCKFHWCCFVKCRQCQRLVELHTCR 

Important features: 
Signal peptide: 

amino acids 1-22 

N-glycosylation sites. 

amino acids 88-91 and 297-300 

Wnt-1 family signature. 

amino acids 206-215 

Homologous region to Wnt-1 family proteins 

amino acids 183-235, 305-350, 97-138, 53-92 and 150 -174 



FIGURE 84 



CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTGGGTGCCTGCAT 
CGCCATGGACACCACCAGGTACAGCAAGTGGGGCGGCAGCTCCGAGGAGGTCCCCGGAGGGC 
CCTGGGGACGCTGGGTGCACTGGAGCAGGAGACCCCTCTTCTTGGCCCTGGCTGTCCTGGTC 
ACCACAGTCCTTTGGGCTGTGATTCTGAGTATCCTATTGTCCAAGGCCTCCACGGAGCGCGC 
GGCGCTGCTTGACGGCCACGACCTGCTGAGGACAAAC6CCTCGAAGCAGACGGCGGCGCTGG 
GTGCCCTGAAGGAGGAGGTCGGAGACTGCCACAGCTGCTGCTCGGGGACGCAGGCGCAGCTG 
CAGACCACGCGCGCGGAGCTTGGGGAGGCGCAGGCGAAGCTGATGGAGCAGGAGAGCGCCCT 
GCGGGAACTGCGTGAGCGCGTGACCCAGGGCTTGGCTGAAGCCGGCAGGGGCCGTGAGGACG 
TCCGCACTGAGCTGTTCCGGGCGCTGGAGGCCGTGAGGCTCCAGAACAACTCCTGCGAGCCG 
TGCCCCACGTCGTGGCTGTCCTTCGAGGGCTCCTGCTACTTTTTCTCTGTGCCAAAGACGAC 
GTGGGCGGCGGCGCAGGATCACTGCGCAGATGCCAGCGCGCACCTGGTGATCGTTGGGGGCC 
TGGATGAGCAGGGCTTCCTCACTCGGAACACGCGTGGCCGTGGTTACTGGCTGGGCCTGAGG 
GCTGTGCGCCATCTGGGCAAGGTTCAGGGCTACCAGTGGGTGGACGGAGTCTCTCTCAGCTT 
CAGCCACTGGAACCAGGGAGAGCCCAATGACGCTTGGGGGCGCGAGAACTGTGTCATGATGC 
TGCACACGGGGCTGTGGAACGACGCACCGTGTGACAGCGAGAAGGACGGCTGGATCTGTGAG 
AAAAGGCACAACTGCTGACCCCGCCCAGTGCCCTGGAGCCGCGCCCATTGCAGCATGTCGTA 



CATCCACCGCTGCTGAGTCTCAGAAACACTTGGCCCAACATAGCCCTGTCCAGCCCAGTGCC 
TGGGCTCTGGGACCTCCATGCCGACCTCATCCTAACTCCACTCACGCAGACCCAACCTAACC 
TCCACTAGCTCCAAAATCCCTGCTCCTGCGTCCCCGTGATATGCCTCCACTTCTCTCCCTAA 
CCAAGGTTAGGTGACTGAGGACTGGAGCTGTTTGGTTTTCTCGCATTTTCCACCAAACTGGA 
AGCTGTTTTTGCAGCCTGAGGAAGCATCAATAAATATTTGAGAAATGAAAAAA 



FIGURE 85 

</usr/ seqdb2/sst/DNA/Dnaseqs.min/ss.DNA56352 
<sijLbunit 1 of 1, 293 aa, 1 stop 
<MW: 32562, pi: 6.53, NX(S/T) : 2 

MDTTRYSKWGGSSEEVPGGPWGRWVHWSRRPLFLALAVLVTTVLWAVILSILLSKASTERAA 
LLDGHDLLRTNASKQTAALGALKEEVGDCHSCCSGTQAQLQTTRAELGEAQAKLMEQESALR 
ELRERVTQGLAEAGRGREDVRTELFRALEAVRLQNNSCEPCPTSWLSFEGSCYFFSVPKTTW 
AAAQDHCADASAHLVIVGGLDEQGFLTRNTRGRGYWLGLRAVRHLGKVQGYQWVDGVSLSFS 
HWNQGEPNDAWGRENCVMMLHTGLWNDAPCDSEKDGWICEKRHNC 

Important features: 

Type II transmembrane domain: 

amino acids 31-54 

N-glycosylation sites. 

amino acids 73-76 and 159-162 

Leucine zipper pattern. 

amino acids 102-123 

N-myristoylation sites. 

amino acids 18-23, 133-138 and 242-247 



C-type lectin domain signature. 

amino acids 264-287 
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GCCAGGGGAAGAGGGTGATCCGACCCGGGGAAGGTCGCTGGGCAGGGCGAGTTGGGAAAGCG 
GCAGCCCCCGCCGCCCCCGCAGCCCCTTCTCCTCCTTTCTCCCACGTCCTATCTGCCTCTCG 
CTGGAGGCCAGGCCGTGCAGCATCGAAGACAGGAGGAACTGGAGCCTCATTGGCCGGCCCGG 
GGCGCCGGCCTCGGGCTTAAATAGGAGCTCCGGGCTCTGGCTGGGACCCGACCGCTGCCGGC 
CGCGCTCCCGCTGCTCCTGCCGGGTGATGGAAAACCCCAGCCCGGCCGCCGCCCTGGGCAAG 
GCCCTCTGCGCTCTCCTCCTGGCCACTCTCGGCGCCGCCGGCCAGCCTCTTGGGGGAGAGTC 
CATCTGTTCCGCCAGAGCCCCGGCCAAATACAGCATCACCTTCACGGGCAAGTGGAGCCAGA 
CGGCCTTCCCCAAGCAGTACCCCCTGTTCCGCCCCCCTGCGCAGTGGTCTTCGCTGCTGGGG 
GCCGCGCATAGCTCCGACTACAGCATGTGGAGGAAGAACCAGTACGTCAGTAACGGGCTGCG 
CGACTTTGCGGAGCGCGGCGAGGCCTGGGCGCTGATGAAGGAGATCGAGGCGGCGGGGGAGG 
CGCTGCAGAGCGTGCACGAGGTGTTTTCGGCGCCCGCCGTCCCCAGCGGCACCGGGCAGACG 
TCGGCGGAGCTGGAGGTGCAGCGCAGGCACTCGCTGGTCTCGTTTGTGGTGCGCATCGTGCC 
CAGCCCCGACTGGTTCGTGGGCGTGGACAGCCTGGACCTGTGCGACGGGGACCGTTGGCGGG 
AACAGGCGGCGCTGGACCTGTACCCCTACGACGCCGGGACGGACAGCGGCTTCACCTTCTCC 
TCCCCCAACTTCGCCACCATCCCGCAGGACACGGTGACCGAGATAACGTCCTCCTCTCCCA6 
CCACCCGGCCAACTCCTTCTACTACCCGCGGCTGAAGGCCCTGCCTCCCATCGCCAGGGTGA 
CACTGCTGCGGCTGCGACAGAGCCCCAGGGCCTTCATCCCTCCCGCCCCAGTCCTGCCCAGC 
AGGGACAATGAGATTGTAGACAGCGCCTCAGTTCCAGAAACGCCGCTGGACTGCGAGGTCTC 
CCTGTGGTCGTCCTGGGGACTGTGCGGAGGCCACTGTGGGAGGCTCGGGACCAAGAGCAGGA 
CTCGCTACGTCCGGGTCCAGCCCGCCAACAACGGGAGCCCCTGCCCCGAGCTCGAAGAAGAG 
GCTGAGTGCGTCCCTGATAACTGCGTCTAAGACCAGAGCCCCGCAGCCCCTGGGGCCCCCCG 
GAGCCATGGGGTGTCGGGGGCTCCTGTGCAGGCTCATGCTGCAGGCGGCCGAGGGCACAGGG 
GGTTTCGCGCTGCTCCTGACCGCGGTGAGGCCGCGCCGACCATCTCTGCACTGAAGGGCCCT 
CTGGTGGCCGGCACGGGCATTGGGAAACAGCCTCCTCCTTTCCCAACCTTGCTTCTTAGGGG 
CCCCCGTGTCCCGTCTGCTCTCAGCCTCCTCCTCCTGCAGGATAAAGTCATCCCCAAGGCTC 
CAGCTACTCTAAATTATGTCTCCTTATAAGTTATTGCTGCTCCAGGAGATTGTCCTTCATCG 
TCCAGGGGCCTGGCTCCCACGTGGTTGCAGATACCTCAGACCTGGTGCTCTAGGCTGTGCTG 
AGCCCACTCTCCCGAGGGCGCATCCAAGCGGGGGCCACTTGAGAAGTGAATAAATGGGGCGG 
TTTCGGAAGCGTCAGTGTTTCCATGTTATGGATCTCTCTGCGTTTGAATAAAGACTATCTCT 
GTTGCTCACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 87 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA53971 
xsubiinit 1 of 1, 331 aa, 1 stop 
xMW: 35844, pi: 5.45, NX(S/T): 2 

MENPSPAAALGKALCALLLATLGAAGQPLGGESICSARAPAKYSITFTGKWSQTAFPKQYPL 
FRPPAQWSSLLGAAHSSDYSMWRKNQYVSNGLRDFAERGEAWALMKEIEAAGEALQSVHEVF 
SAPAVPSGTGQTSAELEVQRRHSLVSFWRIVPSPDWFVGVDSLDLCDGDRWREQAALDLYP 
YDAGTDSGFTFSSPNFATIPQDTVTEITSSSPSHPANSFYYPRLKALPPIARVTLLRLRQSP 
RAFIPPAPVLPSRDNEIVDSASVPETPLDCEVSLWSSWGLCGGHCGRLGTKSRTRYVRVQPA 
mUGS PCPELEEEAECVPDNCV 

Important features: 
Signal peptide; 

amino acids 1-26 
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GGCGGCGTCCGTGAGGGGCTCCTTTGGGCAGGGGTAGTGTTTGGTGTCCCTGTCTTGCGTGA 
TATTGACAAACTGAAGCTTTCCTGCACCACTGGACTTAAGGAAGAGTGTACTCGTAGGCGGA 
CAGCTTTAGTGGCCGGCCGGCCGCTCTCATCCCCCGTAAGGAGCAGAGTCCTTTGTACTGAC 
CAAGATGAGCAACATCTACATCCAGGAGCCTCCCACGAATGGGAAGGTTTTATTGAAAACTA 
CAGCTGGAGATATTGACATAGAGTTGTGGTCCAAAGAAGCTCCTAAAGCTTGCAGAAATTTT 
ATCCAACTTTGTTTGGAAGCTTATTATGACAATACCATTTTTCATAGAGTTGTGCGTGGTTT 
CATAGTCCAAGGCGGAGATCCTACTGGCACAGGGAGTGGTGGAGAGTCTATCTATGGAGCGC 
CATTCAAAGATGAATTTCATTCACGGTTGCGTTTTAATCGGAGAGGACTGGTTGCCATGGCA 
AATGCTGGTTCTCATGATAATGGCAGCCAGTTTTTCTTCACACTGGGTCGAGCAGATGAACT 
TAACAATAAGCATACCATCTTTGGAAAGGTTACAGGGGATACAGTATATAACATGTTGCGAC 
TGTCAGAAGTAGACATTGATGATGACGAAAGACCACATAATCCACACAAAATAAAAAGCTGT 
GAGGTTTTGTTTAATCCTTTTGATGACATCATTCCAAGGGAAATTAAAAGGCTGAAAAAAGA 
GAAACCAGAGGAGGAAGTAAAGAAATTGAAACCCAAAGGCACAAAAAATTTTAGTTTACTTT 
CATTTGGAGAGGAAGCTGAGGAAGAAGAGGAGGAAGTAAATCGAGTTAGTCAGAGCATGAAG 
GGCAAAAGCAAAAGTAGTCATGACTTGCTTAAGGATGATCCACATCTCAGTTCTGTTCCAGT 
TGTAGAAAGTGAAAAAGGTGATGCACCAGATTTAGTTGATGATGGAGAAGATGAAAGTGCAG 
AGCATGATGAATATATTGATGGTGATGAAAAGAACCTQATGAGAGAAAGAATTGCCAAAAAA 
TTAAAAAAGGACACAAGTGCGAATGTTAAATCAGCTGGAGAAGGAGAAGTGGAGAAGAAATC 
AGTCAGCCGCAGTGl^GAGCTCAGAAAAGAAGCAAGACAATTAAAACGGGAACTCTTAGCAG 
CAAAACAAAAAAAAGTAGAAAATGCAGCAAAACAAGCAGAAAAAAGAAGTGAAGAGGAAGAA 
GCCCCTCCAGATGGTGCTGTTGCCGI^TACAGAAGAGAAAAGCAAAAGTATGAAGCTTTGAG 
GAAGCAACAGTCAAAGAAGGGAACTTCCCGGGAAGATCAGACCCTTGCACTGCTGAACCAGT 
TTAAATCTAAACTCACTCAAGCAATTGCTGAAACACCTGAAAATGACATTCCTGAAACAGAA 
GTAGAAGATGATGAAGGATGGATGTCACATGTACTTCAGTTTGAGGATAAAAGCAGAAAAGT 
GAAAGATGCAAGCATGCAAGACTCAGATACATTTGAAATCTATGATCCTCGGAATCCAGTGA 
ATAAAAGAAGGAGGGAAGAAAGCAAAAAGCTGATGAGAGAGAAAAAAGAAAGAAGATAAAAT 
GAGAATAATGATAACCAGAACTTGCTGGAAATGTGCCTACAATGGCCTTGTAACAGCCATTG 
TTCCCAACAGCATCACTTAGGGGT6TGAAAAGAAGTATTTTTGAACCTGTTGTCTGGTTTTG 
AAAAACAATTATCTTGTTTTGCAAATTGTGGAATGATGTAAGCAAATGCTTTTGGTTACTGG 
TACATGTGTTTTTTCCTAGCTGACCTTTTATATTGCTAAATCTGAAATAAAATAACTTTCCT 
TCCACAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA50919 
><sub\mit 1 of 1, 472 aa, 1 stop 
xMW: 53847, pi: 5.75, NX{S/T): 2 

MSNIYIQEPPTNGKVLLKTTAGDIDIELWSKEAPKACRNFIQLCLEAYYDNTIFHRWPGFI 
VQGGDPTGTGSGGESIYGAPFKDEFHSRLRFNRRGLVAMANAGSHDNGSQFFFTLGRADELN 
NKHT I FGKVTGDTVYNMLRLS EVDI DDDERPHNPHKI KSCEVLFNPFDD 1 1 PRE I KRLKKEK 
PEEEVKKLKPKGTKNFSLLSFGEEAEEEEEEVNRVSQSMKGKSKSSHDLLKDDPHLSSVPW 
ESEKGDAPDLVDDGEDESAEHDEYIDGDEKNLMRERIAKKLKKDTSANVKSAGEGEVEKKSV 
SRSEELRKEARQLKRELLAAKQKKVENAAKQAEKRSEEEEAPPDGAVAEYRREKQKYEALRK 
QQSKKGTSREDQTIjy^LNQFKSKLTQAIAETPENDIPETEVEDDEGWMSHVLQFEDKSRKVK 
DASMQDSDTFE I YDPRNPVNKRRREESKKLMREKKERR 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 109-112 and 201-204 

Cyclophilin-type peptidyl -prolyl cis-trans isomerase signature. 

amino acids 49-66 

Homologous region to Cyclophilin-type peptidyl -prolyl cis-trans 
isomerase 

amino acids 96-140, 49-89 and 22-51 



FIGURE 90 



CGCCGCCGTTGGGGCTGGAAGTTCCCGCCAGGTCCGTGCCGGGCGAGAGAGATGCTGCCCGG 
CCCGCCTCGGCTTTGAGGCGAGAGAAGTGTCCCAGACCCATTTCGCCTTGCTGACGGCGTCG 
AGCCCTGGCCAGACATGTCCACAGGGTTCTCCTTCGGGTCCGGGACTCTGGGCTCCACCACC 
GTGGCCGCCGGCGGGACCAGCACAGGCGGCGTTTTCTCCTTCGGAACGGGAACGTCTAGCAA 
CCCTTCTGTGGGGCTCAATTTTGGAAATCTTGGAAGTACTTCAACTCCAGCAACTACATCTG 
CTCCTTCAAGTGGTTTTGGAACCGGGCTCTTTGGATCTAAACCTGCCACTGGGTTCACTCTA 
GGAGGAACAAATACAGGTGCCTTGCACACCAAGAGGCCTCAAGTGGTCACCAAATATGGAAC 
CCTGCAAGGAAAACAGATGCATGTGGGGAAGACACCCATCCAAGTCTTTTTAGGAGTCCCCT 
TCTCCAGACCTCCTCTAGGTATCCTCAGGTTTGCACCTCCAGAACCCCCGGAGCCCTGGAAA 
GGAATCAGAGATGCTACCACCTACCCGCCTGGATGGAGTCTCGCTCTGTCGCCAGGCTGGAG 
TGCAGTGGCACGATCTCGGCTCACTGCAACCTCCGCCTCCCGGGTTCAAGCGAGTCTCCTGC 
CTCAGCCTCTGAGTGTCTGGGGCTACAGGTGCCTGCAGGAGTCCTGGGGCCAGCTGGCCTCG 
ATGTACGTCAGCACGCGGGAACGGTACAAGTGGCTGCGCTTCAGCGAGGACTGTCTGTACCT 
GAACGTGTACGCGCCGGCGCGCGCGCCCGGGGATCCCCAGCTGCCAGTGATGGTCTGGTTCC 
CGGGAGGCGCCTTCATCGTGGGCGCTGCTTCTTCGTACGAGGGCTCTGACTTGGCCGCCCGC 
GAGAAAGTGGTGCTGGTGTTTCTGCAGCACAGGCTCGGCATCTTCGGCTTCCTGAGCACGGA 
CGACAGCCACGCGCGCGGGAACTGGGGGCTGCTGGACCAGATGGCGGCTCTGCGCTGGGTGC 
AGGAGAACATCGCAGCCTTCGGGGGAGACCCAGGAAATGTGACCCTGTTCGGCCAGTCGGCG 
GGGGCCATGAGCATCTCAGGACTGATGATGTCACCCCTAGCCTCGGGTCTCTTCCATCGGGC 
CATTTCCCAGAGTGGCACCGCGTTATTCAGACTTTTCATCACTAGTAACCCACTGAAAGTGG 
CCAAGAAGGTTGCCCACCTGGCTGGATGCAACCACAACAGCACACAGATCCTGGTAAACTGC 
CTGAGGGCACTATCAGGGACCAAGGTGATGCGTGTGTCCAACAAGATGAGATTCCTCCAACT 
GAACTTCCAGAGAGACCCGGAAGAGATTATCTGGTCCATGAGCCCTGTGGTGGATGGTGTGG 
TGATCCCAGATGACCCTTTGGTGCTCCTGACCCAGGGGAAGGTTTCATCTGTGCCCTACCTT 
CTAGGTGTCAACAACCTGGAATTCAATTGGCTCTTGCCTTAT.AATATCACCAAGGAGCAGGT 
ACCACTTGTGGTGGAGGAGTACCTGGACAATGTCAATGAGCATGACTGGAAGATGCTACGAA 
ACCGTATGATGGACATAGTTCAAGATGCCACTTTCGTGTATGCCACACTGCAGACTGCTCAC 
TACCACCGAGAAACCCCAATGATGGGAATCTGCCCTGCTGGCCACGCTACAACAAGGATGAA 
AAGTACCTGCAGCTGGATTTTACCACAAGAGTGGGCATGAAGCTCAAGGAGAAGAAGATGGC 
TTTTTGGATGAGTCTGTACCAGTCTCAAAGACCTGAGAAGCAGAGGCAATTCTAAGGGTGGC 
TATGCAGGAAGGAGCCAAAGAGGGGTTTGCCCCCACCATCCAGGCCCTGGGGAGACTAGCCA 
TGGACATACCTGGGGACAAGAGTTCTACCCACCCCAGTTTAGAACTGCAGGAGCTCCCTGCT 
GCCTCCAGGCCAAAGCTAGAGCTTTTGCCTGTTGTGTGGGACCTGCACTGCCCTTTCCAGCC 
TGACATCCCATGATGCCCCTCTACTTCACTGTTGACATCCAGTTAGGCCAGGCCCTGTCAAC 

CCCACCCTTCAATGTCTCCTTGTGACTCCTTCTTATGGGAGGTCGACCCAGACTGCCACTGC 
CCCTGTCACTGCACCCAGCTTGGCATTTACCATCCATCCTGCTCAACCTTGTTCCTGTCTGT 
TCACATTGGCCTGGAGGCCTAGGGCAGGTTGTGACATGGAGCAAACTTTTGGTAGTTTGGGA 
TCTTCTCTCCCACCCACACTTATCTCCCCCAGGGCCACTCCAAAGTCTATACACAGGGGTGG 
TCTCTTCAATAAAGAAGTGTTGATTAGAAAAAAAAAAA 
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</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA44179 
<subunit 1 of 1, 545 aa, 1 stop 
<MW: 58934, pi: 9-45, NX(S/T) : 4 

MSTGFSFGSGTLGSTTVAAGGTSTGGVFSFGTGTSSNPSVGLNFGNLGSTSTPATTSAPSSG 

FGTGLFGSKPATGFTLGGTNTGJUjHTKRPQWTKYGTLQGKQMHVGKTPIQVFLGVPFSRPP 

LGILRFAPPEPPEPWKGIRDATTYPPGWSLALSPGWSAVARSRLTATSASRVQASLLPQPLS 

VWGYRCLQESWGQLASMYVSTRERYKWLRFSEDCLYLNVYAPARAPGDPQLPVMVWFPGGAF 

IVGAASSYEGSDLAAREKWLVFLQHRLGIFGFLSTDDSHARGNWGLLDQMAALRWVQENIA 

AFGGDPGIWTLFGQSAGAMSISGLMMSPLASGLFHRAISQSGTALFRLFITSNPLKVAKKVA 

HLAGCNHNSTQILVNCLRALSGTKVMRVSNKMRFLQLNFQRDPEEIIWSMSPVVDGWIPDD 

PLVLLTQGKVSSVPYLLGVNNLEFNWLLPYNITKEQVPLVVEEYLDlSrVNEHDWKM 

IVQDATFVYATLQTAHYHRETPMMGICPAGHATTRMKSTCSWILPQEWA 

Important features: 
Signal peptide: 

amino acids 1-29 

Carboxylesterases type-B serine active site. 

amino acids 312-327 

Carboxylesterases type-B signature 2. 

amino acids 218-228 

N-glycosylation sites. 

amino acids 318-321, 380-383 and 465-468 
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GAGAACAGGCCTGTCTCAGGCAGGCCCTGCGCCTCCTATGCGGAGATGCTACTGCCACTGCT 
GCTGTCCTCGCTGCTGGGCGGGTCCCAGGCTATGGATGGGAGATTCTGGATACGAGTGCAGG 
AGTCAGTGATGGTGCCGGAGGGCCTGTGCATCTCTGTGCCCTGCTCTTTCTCCTACCCCCGA 
CAAGACTGGACAGGGTCTACCCCAGCTTATGGCTACTGGTTCAAAGCAGTGACTGAGACAAC 
CAAGGGTGCTCCTGTGGCCACAAACCACCAGAGTCGAGAGGTGGAAATGAGCACCCGGGGCC 
GATTCCAGCTCACTGGGGATCCCGCCAAGGGGAACTGCTCCTTGGTGATCAGAGACGCGCAG 
ATGCAGGATGAGTCACAGTACTTCTTTCGGGTGGAGAGAGGAAGCTATGTGACATATAATTT 
CATGAACGATGGGTTCTTTCTAAAAGTAACAGTGCTCAGCTTCACGCCCAGACCCCAGGACC 
ACAACACCGACCTCACCTGCCATGTGGACTTCTCCAGAAAGGGTGTGAGCGCACAGAGGACC 
GTCCGACTCCGTGTGGCCTATGCCCCCAGAGACCTTGTTATCAGCATTTCACGTGACAACAC 
GCCAGCCCTGGAGCCCCAGCCCCAGGGAAATGTCCCATACCTGGAAGCCCAAAAAGGCCAGT 
TCCTGCGGCTCCTCTGTGCTGCTGACAGCCAGCCCCCTGCCACACTGAGCTGGGTCCTGCAG 
AACAGAGTCCTCTCCTCGTCCCATCCCTGGGGCCCTAGACCCCTGGGGCTGGAGCTGCCCGG 
GGTGAAGGCTGGGGATTCAGGGCGCTACACCTGCCGAGCGGAGAACAGGCTTGGCTCCCAGC 
AGCGAGCCCTGGACCTCTCTGTGCAGTATCCTCCAGAGAACCTGAGAGTGATGGTTTCCCAA 
GCAAACAGGACAGTCCTGGAAAACCTTGGGAACGGCACGTCTCTCCCAGTACTGGAGGGCCA 
AAGCCTGTGCCTGGTCTGTGTCACACACAGCAGCCCCCCAGCCAGGCTGAGCTGGACCCAGA 
GGGGACAGGTTCTGAGCCCCTCCCAGCCCTCAGACCCCGGGGTCCTGGAGCTGCCTCGGGTT 
CAAGTGGAGCACGAAGGAGAGTTCACCTGCCACGCTCGGCACCCACTGGGCTCCCAGCACGT 
CTCTCTCAGCCTCTCCGTGCACTATAAGAAGGGACTCATCTCAACGGCATTCTCCAACGGAG 
CGTTTCTGGGAATCGGCATCACGGCTCTTCTTTTCCTCTGCCTGGCCCTGATCATCATGAAG 
ATTCTACCGAAGAGACGGACTCAGACAGAAACCCCGAGGCCCAGGTTCTCCCGGCACAGCAC 
GATCCTGGATTACATCAATGTGGTCCCGACGGCTGGCCCCCTGGCTCAGAAGCGGAATCAGA 
AAGCCACACCAAACAGTCCTCGGACCCCTCCTCCACCAGGTGCTCCCTCCCCAGAATCAAAG 
AAGAACCAGAAAAAGCAGTATCAGTTGCCCAGTTTCCCAGAACCCAAATCATCCACTCAAGC 
CCCAGAATCCCAGGAGAGCCAAGAGGAGCTCCATTATGCCACGCTCAACTTCCCAGGCGTCA 
GACCCAGGCCTGAGGCCCGGATGCCCAAGGGCACCCAGGCGGATTATGCAGAAGTCAAGTTC 
CA ATGAG GGTCTCTTAGQCTTTAGGACTGGGACTTCGGCTAGGGAGGAAGGTAGAGTAAGAG 
GTTGAAGATAACAGAGTGCAAAGTTTCCTTCTCTCCCTCTCTCTCTCTCTTTCTCTCTCTCT 
CTCTCTTTCTCTCTCTTTTAAAAAAACATCTGGCCAGGGCACAGTGGCTCACGCCTGTAATC 
CCAGCACTTTGGGAGGTTGAGGTGGGCAGATCGCCTGAGGTCGGGAGTTCGAGACCAGCCTG 
GCCAACTTGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGCATGGTGGCAGG 
CGCCTGTAATCCTACCTACTTGGGAAGCTGAGGCAGGAGAATCACTTGAACCTGGGAGACGG 
AGGTTGCAGTGAGCCAAGATCACACCATTGCACGCCAGCCTGGGCAACAAAGCGAGACTCCA 
TCTCAAAAAAAAAATCCTCCAAATGGGTTGGGTGTCTGTAATCCCAGCACTTTGGGAGGCTA 
AGGTGGGTGGATTGCTTGAGCCCAGGAGTTCGAGACCAGCCTGGGCAACATGGTGAAACCCC 
ATCTCTACAAAAAATACAAAACATAGCTGGGCTTGGTGGTGTGTGCCTGTAGTCCCAGCTGT 
CAGACATTTAAACCAGAGCAACTCCATCTGGAATAGGAGCTGAATAAAATGAGGCTGAGACC 
TACTGGGCTGCATTCTCAGACAGTGGAGGCATTCTAAGTCACAGGATGAGACAGGAGGTCCG 
TACAAGATACAGGTCATAAAGACTTTGCTGATAAAACAGATTGCAGTAAAGAAGCCAACCAA 
ATCCCACCAAAACCAAGTTGGCCACGAGAGTGACCTCTGGTCGTCCTCACTGCTACACTCCT 
GACAGCACCATGACAGTTTACAAATGCCATGGCAACATCAGGAAGTTACCCGATATGTCCCA 
AAAGGGGGAGGAATGAATAATCCACCCCTTGTTTAGCA2\ATAAGCAAGAAATAACCATAAAA 
GTGGGCAACCAGCAGCTCTAGGCGCTGCTCTTGTCTATGGAGTAGCCATTCTTTTGTTCCTT 
TACTTTCTTAATAAACTTGCTTTCACCTTAAAAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA54002 
xsubunit 1 of 1, 544 aa, 1 stop 
xMW: 60268, pi: 9.53, NX(S/T): 3 

MLLPLLLSSLLGGSQAMDGRFWIRVQESVMVPEGIiCISVPCSFSYPRQDWTGSTPAYGYWFK 
AVTETTKGAPVATNHQSREVEMSTRGRFQLTGDPAKGNCSLVIRDAQMQDESQYFFRVERGS 
YVTYNFMmDGFFLKVTVLSFTPRPQDHNTDLTCHVDFSRKGVSAQRTVRLRVAYAPRDLVIS 
ISRDNTPALEPQPQGNVPYLEAQKGQFLRLLCAADSQPPATLSWVLQNRVLSSSHPWGPRPL 
GLELPGVKAGDSGRYTCRAENRLGSQQRALDLSVQYPPENLRVMVSQANRTVLENLGNGTSL 
PVLEGQSLCIiVCVTHSSPPARLSWTQRGQVLSPSQPSDPGVLELPRVQVEHEGEFTCHARHP 
LGSQHVSLSLSVHYKKGLISTAFSNGAFLGIGITALLFLCLALIIMKILPKRRTQTETPRPR 
FSRHSTILDYINWPTAGPLAQKRNQKATPNSPRTPPPPGAPSPESKKNQKKQYQLPSFPEP 
KSSTQAPESQESQEELHYATLNFPGVRPRPEARMPKGTQADYAEVKFQ 

Important features : 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 399-418 

N-glycosylatlon site. 

amino acids 100-103, 297-300 and 306-309 

Immunoglobulins and major histocompatibility complex proteins 
signature. 

amino acids 365-371 
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TGAAGAGTAATAGTTGGAATCAAAAGAGTCAACGCA ATGA ACTGTTATTTACTGCTGCGTTT 
TATGTTGGGAATTCCTCTCCTATGGCCTTGTCTTGGAGCAACAGAAAACTCTCAAACAAAGA 
AAGTCAAGCAGCCAGTGCGATCTCATTTGAGAGTGAAGCGTGGCTGGGTGTGGAACCAATTT 
TTTGTACCAGAGGAAATGAATACGACTAGTCATCACATCGGCCAGCTAAGATCTGATTTAGA 
CAATGGAAACAATTCTTTCCAGTACAAGCTTTTGGGAGCTGGAGCTGGAAGTACTTTTATCA 
TTGATGAAAGAACAGGTGACATATATGCCATACAGAAGCTTGATAGAGAGGAGCGATCCCTC 
TACATCTTAAGAGCCCAGGTAATAGACATCGCTACTGGAAGGGCTGTGGAACCTGAGTCTGA 
GTTTGTCATCAAAGTTTCGGATATCAATGACAATGAACCAAAATTCCTAGATGAACCTTATG 
AGGCCATTGTACCAGAGATGTCTCCAGAAGGAACATTAGTTATCCAGGTGACAGCAAGTGAT 
GCTGACGATCCCTCAAGTGGTAATAATGCTCGTCTCCTCTACAGCTTACTTCAAGGCCAGCC 
ATATTTTTCTGTTGAACCAACAACAGGAGTCATAAGAATATCTTCTAAAATGGATAGAGAAC 
TGCAAGATGAGTATTGGGTAATCATTCAAGCCAAGGACATGATTGGTCAGCCAGGAGCGTTG 
TCTGGAACAACAAGTGTATTAATTAAACTTTCAGATGTTAATGACAATAAGCCTATATTTAA 
AGAAAGTTTATACCGCTTGACTGTCTCTGAATCTGCACCCACTGGGACTTCTATAGGAACAA 
TCATGGCATATGATAATGACATAGGAGAGAATGCAGAAATGGATTACAGCATTGAAGAGGAT 
GATTCGCAAACATTTGACATTATTACTAATCATGAAACTCAAGAAGGAATAGTTATATTAAA 
AAAGAAAGTGGATTTTGAGCACCAGAACCACTACGGTATTAGAGCAAAAGTTAAAAACCATC 
ATGTTCCTGAGCAGCTCATGAAGTACCACACTGAGGCTTCCACCACTTTCATTAAGATCCAG 
GTGGAAGATGTTGATGAGCCTCCTCTTTTCCTCCTTCCATATTATGTATTTGAAGTTTTTGA 
AGAAACCCCACAGGGATCATTTGTAGGCGTGGTGTCTGCCACAGACCCAGACAATAGGAAAT 
CTCCTATCAGGTATTCTATTACTAGGAGCAAAGTGTTCAATATCAATGATAATGGTACAATC 
ACTACAAGTAACTCACTGGATCGTGAAATCAGTGCTTGGTACAACCTAAGTATTACAGCCAC 
AGAAAAATACAATATAGAACAGATCTCTTCGATCCCACTGTATGTGCAAGTTCTTAACATCA 
ATGATCATGCTCCTGAGTTCTCTCAATACTATGAGACTTATGTTTGTGAAAATGCAGGCTCT 
GGTCAGGTAATTCAGACTATCAGTGCAGTGGATAGAGATGAATCCATAGAAGAGCACCATTT 
TTACTTTAZ^TCTATCTGTAGAAGACACTAACAATTCAAGTTTTACAATCATAGATAATCAAG 
ATAACACAGCTGTCATTTTGACTAATAGAACTGGTTTTAACCTTCAAGAAGAACCTGTCTTC 
TACATCTCCATCTTAATTGCCGACAATGGAATCCCGTCACTTACAAGTACAAACACCCTTAC 
CATCCATGTCTGTGACTGTG6TGACAGTGGGAGCACACAGACCTGCCAGTACCAGGAGCTTG 
TGCTTTCCATGGGATTCAAGACAGAAGTTATCATTGCTATTCTCATTTGCATTATGATCATA 
TTTGGGTTTATTTTTTTGACTTTGGGTTTAAAACAACGGAGAAAACAGATTCTATTTCCTGA 
GAAAAGTGAAGATTTCAGAGAGAATATATTCCAATATGATGATGAAGGGGGTGGAGAAGAA6 
ATACAGAGGCCTTTGATATAGCAGAGCTGAGGAGTAGTACCATAATGCGGGAACGCAAGACT 
CGGAAAACCACAAGCGCTGAGATCAGGAGCCTATACAGGCAGTCTTTGCAAGTTGGCCCCGA 
CAGTGCCATATTCAGGAAATTCATTCTGGAAAAGCTCGAAGAAGCTAATACTGATCCGTGTG 
CCCCTCCTTTTGATTCCCTCCAGACCTACGCTTTTGAGGGAACAGGGTCATTAGCTGGATCC 
CTGAGCTCCTTAGAATCAGCAGTCTCTGATCAGGATGAAAGCTATGATTACCTTAATGAGTT 
GGGACCTCGCTTTAAAAGATTAGCATGCATGTTTGGTTCTGCAGTGCAGTCAAATAAT TAGG 
GCTTTTTACCATCAAAATTTTTAAAAGTGCTAATGTGTATTCGAACCCAATGGTAGTCTTAA 
AGAGTTTTGTGCCCTGGCTCTATGGCGGGGAAAGCCCTAGTCTATGGAGTTTTCTGATTTCC 
CTGGAGTAAATACTCCATGGTTATTTTAAGCTACCTACATGCTGTCATTGAACAGAGATGTG 
GGGAGAAATGTAAACAATCAGCTCACAGGCATCAATACAACCAGATTTGAAGTAAAATAATG 
TAGGAAGATATTAAAAGTAGATGAGAGGACACAAGATGTAGTCGATCCTTATGCGATTATAT 
CATTATTTACTTAGGAAAGAGTAAAAATACCAAACGAGAAAATTTAAAGGAGCAAAAATTTG 
CAAGTCAAATAGAAATGTACAAATCGAGATAACATTTACATTTCTATCATATTGACATGAAA 
ATTGAAAATGTATAGTCAGAGAAATTTTCATGAATTATTCCATGAAGTATTGTTTCCTTTAT 
TTAAA 



FIGURE 95 



X /usr / seqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA5 3 9 0 6 
xsubxinit 1 of 1, 772 aa, 1 stop 
xMW: 87002, pi: 4.64, NX(S/T): 8 

MNCYLLLRFMLGIPLLWPCLGATENSQTKKVKQPVRSHLRVKRGP?VWNQFFVPEEMNTTSHH 
IGQLRSDLDNGNNSFQYKLLGAGAGSTFIIDERTGDIYAIQKLDREERSLYILRAQVIDIAT 
GRAVEPESEFVIKVSDINDNEPKFLDEPYEAIVPEMSPEGTLVIQVTASDADDPSSGNNARL 
LYS LLQGQP YES VEPTT6VI RI S S KMDRELQDE YWI IQAKDMIGQPGALSGTTSVLI KLSD 
VHDNKPIFKESLYRLTVSESAPTGTSIGTIMAYDNDIGENAEMDYSIEEDDSQTFDIITNHE 
TQEGIVILKKKVDFEHQNHYGIRAKVKNHHVPEQLMKYHTEASTTFIKIQVEDVDEPPLFLL 
PYYVFEVFEETPQGSFVGWSATDPDNRKSPIRYSITRSKVFNINDNGTITTSNSLDREISA 
WYNLS ITATEKYNI EQ I SS I PLYVQVLNINDHAPEFSQYYETYVCENAGSGQVIQTI SAVDR 
DES lEEHHFYFNLSVEDTNNS S FTI IDNQDNTAVILTNRTGFNLQEEPVFYI SILIADNGI P 
SLTSTNTLTIHVCDCGDSGSTQTCQYQELVLSMGFKTEVIIAILICIMIIFGFIFLTLGLKQ 
RRKQILFPEKSEDFRENIFQYDDEGGGEEDTEAFDIAELRSSTIMRERKTRKTTSAEIRSLY 
RQSLQVGPDSAIFRKFILEKLEEANTDPCAPPFDSLQTYAFEGTGSLAGSLSSLESAVSDQD 
ESYDYLNELGPRFKRLACMFGSAVQSNN 

Important features: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino . acids 597-617 

N-glycosylation sites. 

amino acids 57-60, 74-77, 419-423, 437-440, 508-511, 515-518, 
516-519 and 534-537 

Cadherins extracellular repeated domain signature. 

amino acids 136-146 and 244-254 



FIGURE 96 



ATTTCAAGGCCAGCCATATTTTTNTGTTGAACCAACAACAGGAGTCATAAGAATATTTTNTA 
AAATGGATAGAGAACTGCAAGATGAGTATTGGGTAATCATTCAAGCCAAGGACATGATTGGT 
CAGCCAGGAGCGTTGTNTGGAACAACAAGTGTATTAATTAAACTTTCAGATGTTAATGACAA 
TAAGCCTATATTTAAAGAAAGTTTATACCGCTTGACTGTNTNTGAATCTGCACCCACTGGGA 
NTTNTATAGGAACAATCATGGCATATGATAATGACATAGGAGAGAATGCAGAAATGGATTAC 
AGCATTGAAGAGGATGATTCGCAAACATTTGACATTATT 



FIGURE 97 



GCAACCTCAGCTTCTAGTATCCAGACTCCAGCGCCGCCCCGGGCGCGGACCCCAACCCCGAC 
CCAGAGCTTCTCCAGCGGCGGCGCAGCGAGCAGGGCTCCCCGCCTTAACTTCCTCCGCGGGG 
CCCAGCCACCTTCGGGAGTCCGGGTTGCCCACCTGCAAACTCTCCGCCTTCTGCACCTGCCA 
CCCCTGAGCCAGCGCGGGCCCCCGAGCGAGTCATGGCCAACGCGGGGCTGCAGCTGTTGGGC 
TTCATTCTCGCCTTCCTGGGATGGATCGGCGCCATCGTCAGCACTGCCCTGCCCCAGTGGAG 
GATTTACTCCTATGCCGGCGACAACATCGTGACCGCCCAGGCCATGTACGAGGGGCTGTGGA 
TGTCCTGCGTGTCGCAGAGCACCGGGCAGATCCAGTGCAAAGTCTTTGACTCCTTGCTGAAT 
CTGAGCAGCACATTGCAAGCAACCCGTGCCTTGATGGTGGTTGGCATCCTCCTGGGAGTGAT 
AGCAATCTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGTGCTTGGAAGACGATGAGGTGC 
AGAAGATGAGGATGGCTGTCATTGGGGGTGCGATATTTCTTCTTGCAGGTCTGGCTATTTTA 
GTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAATTCTATGACCCTATGACCCCAGT 
CAATGCCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCC 
TTCTGGGAGGTGCCCTACTTTGCTGTTCCTGTCCCCGAAAAACAACCTCTTACCCAACACCA 
AGGCCCTATCCAAAACCTGCACCTTCCAGCGGGAAAGACTACGTGTGACACAGAGGCAAAAG 
GAGAAAATCATGTTGAAACAAACCGAAAATGGACATTGAGATACTATCATTAACATTAGGAC 
CTTAGAATTTTGGGTATTGTAATCTGAAGTATGGTATTACAAAACAAACAAACAAACAAAAA 
ACCCATGTGTTAAAATACTCAGTGCTAAACATGGCTTAATCTTATTTTATCTTCTTTCCTCA 
ATATAGGAGGGAAGATTTTTCCATTTGTATTACTGCTTCCCATTGAGTAATCATACTCAAAT 
GGGGGAAGGGGTGCTCCTTAAATATATATAGATATGTATATATACATGTTTTTCTATTAAAA 
ATAGACAGTAAAATACTATTCTCATTATGTTGATACTAGCATACTTAAAATATCTCTAAAAT 
AGGTAAATGTATTTAATTCCATATTGATGAAGATGTTTATTGGTATATTTTCTTTTTCGTCC 
TTATATACATATGTAACAGTCAAATATCATTTACTCTTCTTCATTAGCTTTGGGTGCCTTTG 
CCACAAGACCTAGCCTAATTTACCAAGGATGAATTCTTTCAATTCTTCATGCGTGCCCTTTT 
CATATACTTATTTTATTTTTTACCATAATCTTATAGCACTTGCATCGTTATTAAGCCCTTAT 
TTGTTTTGTGTTTCATTGGTCTCTATCTCCTGAATCTAACACATTTCATAGCCTACATTTTA 
GTTTCTAAAGCCAAGAAGAATTTATTACAAATCAGAACTTTGGAGGCAAATCTTTCTGCATG 
ACCAAAGTGATAAATTCCTGTTGACCTTCCCACACAATCCCTGTACTCTGACCCATAGCACT 
CTTGTTTGCTTTGAAAATATTTGTCCAATTGAGTAGCTGCATGCTGTTCCCCCAGGTGTTGT 
AACACAACTTTATTGATTGAATTTTTAAGCTACTTATTCATAGTTTTATATCCCCCTAAACT 
ACCTTTTTGTTCCCCATTCCTTAATTGTATTGTTTTCCCAAGTGTAATTATCATGCGTTTTA 
TATCTTCCTAATAAGGTGTGGTCTGTTTGTCTGAACAAAGTGCTAGACTTTCTGGAGTGATA 
ATCTGGTGACAAATATTCTCTCTGTAGCTGTAAGCAAGTCACTTAATCTTTCTACCTCTTTT 
TTCTATCTGCCAAATTGAGATAATGATACTTAACCAGTTAGAAGAGGTAGTGTGAATATTAA 
TTAGTTTATATTACTCTTATTCTTTGAACATGAACTATGCCTATGTAGTGTCTTTATTTGCT 
CAGCTGGCTGAGACACTGAAGAAGTCACTGAACAAAACCTACACACGTACCTTCATGTGATT 
CACTGCCTTCCTCTCTCTACCAGTCTATTTCCACTGAACAAAACCTACACACATACCTTCAT 
GTGGTTCAGTGCCTTCCTCTCTCTACCAGTCTATTTCCACTGAACAAAACCTACGCACATAC 
CTTCATGTGGCTCAGTGCCTTCCTCTCTCTACCAGTCTATTTCCATTCTTTCAGCTGTGTCT 
GACATGTTTGTGCTCTGTTCCATTTTAACAACTGCTCTTACTTTTCCAGTCTGTACAGAATG 
CTATTTCACTTGAGCAAGATGATGTAATGGAAAGGGTGTTGGCACTGGTGTCTGGAGACCTG 
GATTTGAGTCTTGGTGCTATCAATCACCGTCTGTGTTTGAGCAAGGCATTTGGCTGCTGTAA 
GCTTATTGCTTCATCTGTAAGCGGTGGTTTGTAATTCCTGATCTTCCCACCTCACAGTGATG 
TTGTGGGGATCCAGTGAGATAGAATACATGTAAGTGTGGTTTTGTAATTTAAAAAGTGCTAT 
ACTAAGGGAAAGAATTGAGGAATTAACTGCATACGTTTTGGTGTTGCTTTTCAAATGTTTGA 
AAATAAAAAAAATGTTAAG 



FIGURE 98 



></usr/ seqdb2/ sst/DNA/Dnaseqs .min/ss .DNA52185 
xsiibunit 1 of 1, 211 aa, 1 stop 
xMW: 22744, pi: 8.51, NX(S/T): 1 

MANAGLQLLGFILAFLGWIGAIVSTALPQWRIYSYAGDNIVTAQAMYEGLWMSCVSQSTGQI 
QCKVFDSLLNLSSTLQATRALMWGILLGVIAIFVATVGMKCMKCLEDDEVQKMRMAVIGGA 
IFLLAGLAILVATAWYGNRIVQEFYDPMTPVNARYEFGQALFTGWAAASLCLLGGALLCCSC 
PRKTTS YPTPRPYPKPAPS SGKDYV 

Important features: 
Signal peptide: 

amino acids 1-21 

Transmembrane domains: 

amino acids 82-102, 118-142 and 161-187 

N-glycosylation site. 

amino acids 72-75 

PMP-22 / EMP / MP20 family proteins 
amino acids 70-111 

ABC-2 type transport system integral membrane protein 

amino acids 119-133 



FIGURE 99 



TTCTGGCCAAACCCGGGGCTNCAGCTGTTGGGCTTCATCTCGCCTTCCTGGGATGGATCGGC 
GCCATCNTCACACTGCCCTTCCCCAGTGGAGGATTTTACTCCCTATGCTGGCGACAACATCG 
TGACCGCCCAGCCCATGTACGAGGGGCTGTGGATGTCCNGCGTGTCGCAGAGCACCGGGCAG 
ATCCAGTGCAAAGTCTTTGACTCCTTGCTGAATCTGAGCAGCACATTGCAAGCAACCCGTGC 
CTTGATGGTGGTTGGCATCCTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTTGGCATGA 
AGTGTATGAAGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGC 
GCGATATTTCTTCTTGCAGGTCTGGCTATTTTAGTTGCCACAGCATGGTATGGCAATAGAAN 
CNTTCAACANTTCTATGACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCA 
GGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTGCCCTACTTTGCT 
GTTCCTGTCCC 



FIGURE 100 



ACCCTTGACCCAACGCGGCCCCCCGACCGNTTCATGGCCAAACGCGGGNCTCCAGCTGTTGG 
GCTTCATTCTCCCCTTCCTGGGATGGACCGGCGCCCATCNTCAGCACTGCCCTGCCCCAGTG 
GAGGATTTACTCCTATNCCGGCNACAACATCGTGACCGCCCAGGCCNTGTACGAGGGGCTGT 
GGATGTCCTGCGTGTCGCAGAGCACCGGGCAGATCCAGTGCAAAGTCTTTGACTCCCTTGCT 
GAATCTGAGCAGCACATTGCAAGCAACCCGTGCCTTGATGGTGGTTGGCATCCTCCTGGGAG 
TGATAGCAATCTTNNTGGCCACCGTTGTNNNTGAAGTGTATGAAGTGCTTGGAAGACGATGA 
GGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTTCTTGCAGGTCTGGCTA 
TTTTAGTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAATTCTATGACCCTATGACCGA 



FIGURE 101 



GGGCCCGACCATTATCCAACCGGGNTCACTGTTGGCTCATCTCCCTCCTGGATGAANCGCGC 
CATCNTCAGACTCCCTGCCCCATGGAGATTTNNCCTATGCTGGCGACAACATCNTGACCCCC 
AGCCATGTACGAGGGGCTTTGAACGTCNGCGTGTCGCAGANCACCGGGCAGATCCAGTGCAA 
AGTCTTTGACTCCTTGCTGAATCTGNGCAGCACATTGCAGCAACCCNTGCCCTGATGGTGGT 
TGGCATCCTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGT 
GCTTGGAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTT 
CTTGCAGGTCTGGCTATTTNNNGTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAAT 
TCTATGACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGC 
TGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTGCCCTACTTTGCTGTTCCTGCGA 



FIGURE 102 



ATTCTCCCCTCCTGGATGGATCGCNCCACCGTCACATTGCCTTCCCCCANTGGAGGATTNAC 
TCCTATGCTGGCGACAACATCGTGACCCCCCAGGCCATTTACCGAGGGGCTTTGGATGTCNT 
GCNTGTCGCAGAGCACCGGGCAGATCCCAGTGCAAAGTCTTTGACTCCTTGCTGAATCTGAG 
CAGCACATTGCAAGCAACCCGTGCCTTGATGGGGTTGGCATCCTCCTGGGAGTGATAGCAAC 
CTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGTGCTTGGAAGACGATGAGGTGCCAGAAG 
ATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTTGTTGCAGGTCTGGCTATTTTAGTNGC 
CACAGCATGGTATGGCAATAGANTlSnsrTTCNNGE^ 

CCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTG 
GGAGGTGCCCTACTTTGCTGTTCCTGTCCC 



FIGURE 103 



AGAGCACCGGCAGATCCCAGTNCAAAGTCTTTGACCCTTGCTGAATCTGAGCAGCACATTNC 
AAGCAACCCCTTGCCTTGAAGGTGGTTGNCATCCCCCCTGGGAGTGAATAGCAATCTTTGTG 
GCCACCGTTGGCATGAAGTNTATGAAGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGAT 
GGCTGTCATTGGGGGCGCGATATTTCTTCTTGCAGGTCTGGCTATTTTAGTNNCCACAGCAT 
GGTATGGCAATAGNATNNTTCGNGGNTTCTATGACCCTATGACCCCAGTCAATGCCAGGTAC 
GAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTGC 
CCTACTTTGCTGTTCCTGTCCCCGAA 



FIGURE 104 



AGCAATGCCCTGCCCCCAGTGGAGGATTAATTCCTATGNTGGGGACAACATTGTGACNGCCC 
AGGCCATGTACGGGGGGCTGTGGATGTCCTGCGTGTCGCAGAGCACCGGGCAGATCCAGTGC 
AAAGTNTTTGACTCCTTGCTGAATTTGAGCAGCACATTGCAAGCAACCCGTGCCTTGATGGT 
GGTTGGCATCTTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTGGNAATGAAGTGTATGA 
AGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTT 
CTTNTTGCAGGTCTGGCTATTTTAGTTGCCACAGCATGGTATGGCAATAGAATNGTTCAAGA 
ATTTTATGACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCAGGCTTTNTTCACTG 
GCTGGGCTGCTGCTTNTTTCTGCCTTNTGGGAGGTGCCCTANTTTGCTGTTCCTGCGAACC 



FIGURE 105 



TCATAGGGGGGCGCGATATTTTTTCTTGCAGGTNTGGTTATTTTAGTTGCCACAGCATGGTA 
TGGCAATAGAATCGTTCT^GAATTNTATGACCCTATGACCCCAGTCAATGCCAGGTACGAAT 
TTGGTCAGGCTCTNTTCACTGGNTGGGCTGCTGCTTCTNTNNGCCTTNTGGGAGGTGCCCTA 
CTTTGCTGTTCCTG 



FIGURE 106 

TTCCTGGGATGGATCCGCCCCCATCNTCACATGCCCTGCCCCNTGGAGATTTACNCCTATGC 
TGGCGAACAACATCNTGACCGCCCAGGCCATGTACGAGGGGCTGTGGAATGTCCTGCGTGTC 
CCAGAGCACCGGGCAGATCCAGTGCAAAGTCTTTGACTCCTTGCTGAATCTGAGCAGCACAT 
TGCAAGCAACCNTGCCTTGATGGTGGTTGGCATCCTCCTGGGAGTGATAGCAATCTTTGTGG 
CCACCGTTGGCATGAAAGTGTATGAAGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGAT 
GGCTGTCATTGGGGGCGCGATATTTCTTCTTGCAGGTCTGGCTATTTTAGNNGCCACAGCAT 
GGTATGGCAATCAGACCCNNTCANAAACTCTATGACCCTATGACCCCAGTCAATGCCAGGTA 
CGAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTG 
CCCTACTTTGCTGTTCCTGTCCCCGAAAAACAACCTCTTACCCACG 



FIGURE 107 

CGGGGCTGCAGCTGTTGGGCTTCATCTCGCTTCCTGGGATGGAATCGGCGCCATCGTCAGCA 
CTGCCCTGCCCCATGGAGGATTTACTCNTATGCTGGCGACAACATCGTGACCNCCCAGGCCA 
TGTACGAGGGGCTGTGGATGTCNGCGTGTCGCAGAGCACCGGGCAGATCCAGTGCAAAGTCT 
TTGACTCCTTGCTGAATCTGAGCAGCACATTGCAAGCAACCNTGCCTTGATGGTGGTTGGCA 
TCCTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGTGCTTG 
GAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTTCTTGC 
AGGTCTGGCTATTTNTAGTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAATTCTAT 
GACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGCTGGGC 
TGCTGCTTCTCTCTGCCTTCTGGGAGGTGCCCTACTTTGCTGTTCCTGCGAA 



FIGURE 108 



GCGTGCCGTCAGCTCGCCGGGCACCGCGGCCTCGCCCTCGCCCTCCGCCCCTGCGCCTGCAC 
CGCGTAGACCGACCCCCCCCTCCAGCGCGCCCACCCGGTAGAGGACCCCCGCCCGTGCCCCG 
ACCGGTCCCCGCCTTTTTGTAAAACTTAAAGCGGGCGCAGCATTAACGCTTCCCGCCCCGGT 
GACCTCTCAGGGGTCTCCCCGCCAAAGGTGCTCCGCCGCTAAGGAAC ATGG CGAAGGTGGAG 
CAGGTCCTGAGCCTCGAGCCGCAGCACGAGCTCAAATTCCGAGGTCCCTTCACCGATGTTGT 
CACCACCAACCTAAAGCTTGGCAACCCGACAGACCGAAATGTGTGTTTTAAGGTGAAGACTA 
CAGCACCACGTAGGTACTGTGTGAGGCCCAACAGCGGAATCATCGATGCAGGGGCCTCAATT 
AATGTATCTGTGATGTTACAGCCTTTCGATTATGATCCCAATGAGAAAAGTAAACACAAGTT 
TATGGTTCAGTCTATGTTTGCTCCAACTGACACTTCAGATATGGAAGCAGTATGGAAGGAGG 
CAAAACCGGAAGACCTTATGGATTCAAAACTTAGATGTGTGTTTGAATTGCCAGCAGAGAAT 
GATAAACCACATGATGTAGAAATAAATAAAATTATATCCACAACTGCATCAAAGACAGAAAC 
ACCAATAGTGTCTAAGTCTCTGAGTTCTTCTTTGGATGACACCGAAGTTAAGAAGGTTATGG 
AAGAATGTAAGAGGCTGCAAGGTGAAGTTCAGAGGCTACGGGAGGAGAACAAGCAGTTCAAG 
GAAGAAGATGGACTGCGGATGAGGAAGACAGTGCAGAGCAACAGCCCCATTTCAGCATTAGC 
CCCAACTGGGAAGGAAGAAGGCCTTAGCACCCGGCTCTTGGCTCTGGTGGTTTTGTTCTTTA 
TCGTTGGTGTAATTATTGGGAAGATTGCCTT GTAGA GGTAGCATGCACAGGATGGTAAATTQ 
GATTGGTGGATCCACCATATCATGGGATTTAAATTTATCATAACCATGTGTAAAAAGAAATT 
AATGTATGATGACATCTCACAGGTCTTGCCTTTAAATTACCCCTCCCTGCACACACATACAC 
AGATACACACACACAAATATAATGTAACGATCTTTTAGAAAGTTAAAAATGTATAGTAACTG 
ATTGAGGGGGAAAAAGAATGATCTTTATTAATGACAAGGGAAACCATGAGTAATGCCACAAT 
GGCATATTGTAAATGTCATTTTAAACATTGGTAGGCCTTGGTACATGATGCTGGATTACCTC 
TCTTAAAATGACACCCTTCCTCGCCTGTTGGTGCTGGCCCTTGGGGAGCTGGAGCCCAGCAT 
GCTGGGGAGTGCGGTCAGCTCCACACAGTAGTCCCCACGTGGCCCACTCCCGGCCCAGGCTG 
CTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGACTGATGAACAGAGTCAGA 
AGCCCAAAGGAATTGCACTGTGGCAGCATCAGACGTACTCGTCATAAGTGAGAGGCGTGTGT 
TGACTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCACTTAAAGGGACCAA 
GCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAACTGTTATTCAGAGATGTTTAATGCATA 
TTTAACTTATTTAATGTATTTCATCTCATGTTTTCTTATTGTCACAAGAGTACAGTTAATGC 
TGCGTGCTGCTGAACTCTGTTGGGTGAACTGGTATTGCTGCTGGAGGGCTGTGGGCTCCTCT 
GTCTCTGGAGAGTCTGGTCATGTGGAGGTGGGGTTTATTGGGATGCTGGAGAAGAGCTGCCA 
GGAAGTGTTTTTTCTGGGTCAGTAAATAACAACTGTCATAGGGAGGGAAATTCTCAGTAGTG 
ACAGTCAACTCTAGGTTACCTTTTTTAATGAAGAGTAGTCAGTCTTCTAGATTGTTCTTATA 
CCACCTCTCAACCATTACTCACACTTCCAGCGCCCAGGTCCAAGTCTGAGCCTGACCTCCCC 
TTGGGGACCTAGCCTGGAGTCAGGACAAATGGATCGGGCTGCAGAGGGTTAGAAGCGAGGGC 
ACCAGCAGTTGTGGGTGGGGAGCAAGGGAAGAGAGAAACTCTTCAGCGAATCCTTCTAGTAC 
TAGTTGAGAGTTTGACTGTGAATTAATTTTATGCCATAAAAGACCAACCCAGTTCTGTTTGA 
CTATGTAGCATCTTGAAAAGAAAAATTATAATAAAGCCCCAAAATTAAGAAAA 



FIGURE 109 

< / us r / s eqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA5 3977 
<subunit 1 of 1, 243 aa, 1 stop 
<m: 27228, pi: 7.43, NX(S/T): 2 

MAKVEQVLSLEPQHELKFRGPFTDVVTTNLKLGNPTDRNVCFKVKTTAPRRYCVRPNSGIID 
AGASIIWSVMLQPFDYDPNEKSKHKPMVQSMFAPTDTSDMEAWKEAKPEDIJVIDSKLRCVFE 
LPAENDKPHDVEINKI I STTASKTETP I VSKSLSSSLDDTEVKKVMEECKRLQGEVQRLREE 
NKQFKEEDGLRMRKTVQSNSPISALAPTGKEEGLSTRLLALWLFFIVGVIIGKIAL 

Important features: 
Transmembrane domain: 

amino acids 224-239 

N-glycosylation site. 

amino acids 68-71 

N-myristoylation site. 

amino acids 59-64, 64-69 and 235-240 



FIGURE 110 



GTCAGTCTTCTAGATTGTCCTTATCCCACCTTTCAACCANTACTCACATTTCNAGCGCCCAG 
GTCCANGTCTGAGCCTGACTTCCCCTTGGGGACCTAGCCTGGAGTCAGGACAATGGNTCGGG 
CTGCAGAGGNTTAGAAGCGAGGGCACCAGCAGTTTTGGGTGGGGAGCAAGGGNNGAGAGAAA 
CTCTTCAGCGAATCCTTCTAGTACTAGTTGAGAGTTTGACTGTGAATTAATTTTATGCCATA 
AAAGACNAACCCAGTTCTGTTTGACTATGTAGCATCTTGAAAAGAAAAATTATAATAAAGCC 
CCAAAATTAAGAATTCTTTTGTCATTTTGTCACATTTGCTCTATGGGGGGAATTATTATTTT 
ATCATTTTTATTATTTTGCCATTGGAAGGTTAACTTTAAAATGAGC 



FIGURE 111 



TATTGTAAAGGCCATTTTAAACCATTGGTAGGCCTTGGTACATGATGCTGGATTACCTCCTT 
AAATGACACCNTTCCTCGCCTGTTGGTGCTGGCCNTTGGGGAGCTGGAGCCCCAGCATGCTG 
GGGAGTGCGGTCAGCTCCACACAGTAGTCCCCACGTGGCCCACTCCCGGCCCAGGCTGCTTT 
CCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGACTGATGAACAGAGTCAGAAGCC 
CAAAGGAATTGCCACTGTGGCAGCATCAGACGTACTCGTCATAAGTGAGAGGCGTGTGTTGA 
CTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCACTTAAAGGGACCAA6CT 
AAATTGTATTGGTTCATGTAGTGAAGTCAAACTGTTATTCAGAGATGTTTAATGCATATTTA 
ACTTATTTAATGTATTTCATCTCATGTTTTCTTATTGTCACAAGAGTACAGTTAATGCTGCG 
TGCTGCTGAACTCTGTTGGGTGAACTGGTATTGCTGCTGGAGGGCTG 



FIGURE 112 



CCCTGGTGGTTTTGTTCTTTAATTCGTTGGTGTAATTNTTGGGAAGATTGCTTGTAGAGGTA 

GNATGCACCNGGCTGGTAAATTGGATTGGTGGATCCACCATATCCATGGGATTTAAATTTAT 

CATAACCATGTGTAAAAAGAAATTAATGTATGATGACATNTCACAGGTATTGCCTTTAAATT 

ACCCATCCCTGNANACACATACACAGATACACANANACAAATNTAATGTAACGATKPTTT^ 

AAAGTTAAAAATGTATAGTAAC 



FIGURE 113 



GGTGGCCCATTCCCGGCCCAGGCTGCTTTCCGGTNTTCAGTTCTGTCCAAGCCATCAGCTCC 
TTGGGACTGATGAACAGAGTCAGAAGCCCAAAGGAATTGCACTGTGGCAGCATNAGACGTAC 
TTGTNATAAGTGAGAGGCGTGTGTTGACTGATTGACCCAGCGCTTTGGAAATAAATGGCAGT 
GCTTTGTTCANTTAAAGGGACCAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAACTG 
TTATTCAGAGATGTTTAATGCATATTTAANTTATTTAATGTATTTNATNTCATGTTTTCTTA 
TTGTCACAAGAGTACAGTTAATGCTGCGTGCTGCTGAANTNTGTTGGGTGAACTGGTATTGC 
TGCTGGAGGGCTGTGGGCTCCTCTGTCTTTGGAGAGTCTGGTCATGTGGAGGTGGG 



FIGURE 114 



TGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGACTTGATGAACAGAGTC 
AGAAGCCCAAAGGAATTGCACTGTGGCAGCATCAGACGTACTCGTCATAAGTGAGAGGCGTG 
TGTTGACTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCACTTAAAGGGAC 
CAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAACTGTTATTCAGAGATGTTTAATGC 
ATATTTAACTTATTTAATGTATTTCATCTCATGTTTTCTTATTGTCACAAGAGTACAGTTAA 
TGCTGCGTGC 



FIGURE 115 



AAACCTTTAAAAGTTGAGGGGAAAAGAATGATCCTTTATTAATGACAAGGGAAACCNTGNGT 
AATGCCACAATGGCATATTGTAAATGTCATTTTAAACATTGGTAGGCCTTGGTACATGATGC 
TGGATTACCTCTCTTAAAATGACACCCTTCCTCGCCTGTTGGTGCTGGCCCTTGGGGAGCTN 
GAGCCCAGCATGCTGGGGAGTGCGGTCTGCTCCACACAGTAGTCCCCANGTGGCCCANTCCC 
GGCCCAGGCTGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGANTGATGA 
ACAGAGTCAGAAGCCCAAAGGAATTGCANTGTGGCAGCATCAGANGTANTNGTCATAAGTGA 
GAGGCGTGTGTTGANTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCANTT 
AAAGGGWCCAAGNTAAATTTGTATTGGTTCATGTAGTGAAGTCAl^TGTTATTCAGAGATG 
TTTAAT6CATATTTAANTTATTTAATGTATTTCATNTCATGTTTTCTTATTGTCACAAGGGT 
ACAGTTAATGCTGCGTGCTGCTGAANTCTGTTGGGTGAANTGGTATTGCTG 



FIGURE 116 



GGCCCTTGGGGAGCTGGAGCCCAGCATGCTGGGGAGTGCGGTCAGCTCCACACAGTAGTCCC 
CACGTGGCCCACTCCCGGCCCAGGCTGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGC 
TCCTTGGGACTGATGAACAGAGTCAGAAGCCCAAAGGAATTGCACTGTGGCAGCATCAGACG 
TACTCGTCATAAGTGAGAGGCGTGTGTTGACTGATTGACCCAGCGCTTTGGAAATAAATGGC 
AGTGCTTTGTTCACTTAAAGGGACCAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAA 
CTGTTATTCAGAGATGTTTAATGCATATTTAACTTATTTAATGTATTTCATCTCATGTTTTC 
TTATTGTCACAAGAGTACAGTTAATGCTGCGTGCTGCTGAACTCTGTTGGGTGAACTGGTAT 
TGCTGCTGGAGGGCTGTGGGCTCCTCTGTCTCTGGAGAGTCTGGTCATGTGGAGGTGGG 



FIGURE 117 



GCGAGCTCCGGGTGCTGTGGCCCGGCCTTGGCGGGGCGGCCTCCGGCTCAGGCTGGCTGAGA 
GGCTCCCAGCTGCAGCGTCCCCGCCCGCCTCCTCGGGAGCTCTGATCTCAGCTGACAGTGCC 
CTCGGGGACCAAACAAGCCTGGCAGGGTCTCACTTTGTTGCCCAGGCTGGAGTTCAGTGCCA 
TGATCATGGTTTACTGCAGCCTTGACCTCCTGGGTTCAAGCGATCCTGCTGAGTAGCTGGGA 
CTACAGGACAAAATTAGAAGATCAAAATGGAAAATATGCTGCTTTGGTTGATATTTTTCACC 
CCTGGGTGGACCCTCATTGATGGATCTGAAATGGAATGGGATTTTATGTGGCACTTGAGAAA 
GGTACCCCGGATTGTCAGTGAAAGGACTTTCCATCTCACCAGCCCCGCATTTGAGGCAGATG 
CTAAGATGATGGTAAATACAGTGTGTGGCATCGAATGCCAGAAAGAACTCCCAACTCCCAGC 
CTTTCTGAATTGGAGGATTATCTTTCCTATGAGACTGTCTTTGAGAATGGCACCCGAACCTT 
AACCAGGGTGAAAGTTCAAGATTTGGTTCTTGAGCCGACTCAAAATATCACCACAAAGGGAG 
TATCTGTTAGGAGAAAGAGACAGGTGTATGGCACCGACAGCAGGTTCAGCATCTTGGACAAA 
AGGTTCTTAACCAATTTCCCTTTCAGCACAGCTGTGAAGCTTTCCACGGGCTGTAGTGGCAT 
TCTCATTTCCCCTCAGCATGTTCTAACTGCTGCCCACTGTGTTCATGATGGAAAGGACTATG 
TCAAAGGGAGTAAAAAGCTAAGGGTAGGGTTGTTGAAGATGAGGAATAAAAGTGGAGGCAAG 
AAACGTCGAGGTTCTAAGAGGAGCAGGAGAGAAGCTAGTGGTGGTGACCAAAGAGAGGGTAC 
CAGAGAGCATCTGCAGGAGAGAGCGAAGGGTGGGAGAAGAAGAAAAAAATCTGGCCGGGGTC 
AGAGGATTGCCGAAGGGAGGCCTTCCTTTCAGTGGACCCGGGTCAAGAATACCCACATTCCG 
AAGGGCTGGGCACGAGGAGGCATGGGGGACGCTACCTTGGACTATGACTATGCTCTTCTGGA 
GCTGAAGCGTGCTCACAAAAAGAAATACATGGAACTTGGAATCAGCCCAACGATCAAGAAAA 
TGCCTGGTGGAATGATCCACTTCTCAGGATTTGATAACGATAGGGCTGATCAGTTGGTCTAT 
CGGTTTTGCAGTGTGTCCGACGAATCCAATGATCTCCTTTACCAATACTGCGATGCTGAGTC 
GGGCTCCACCGGTTCGGGGGTCTATCTGCGTCTGAAAGATCCAGACAAAAAGAATTGGAAGC 
GCAAAATCATTGCGGTCTACTCAGGGCACCAGTGGGTGGATGTCCACGGGGTTCAGAAGGAC 
TACAACGTTGCTGTTCGCATCACTCCCCTAAAATACGCCCAGATTTGCCTCTGGATTCACGG 
GAACGATGCCAATTGTGCTTACGGCTAACAGAGACCTGAAACAGGGCGGTGTATCATCTAAA 
TCACAGAGAAAACCAGCTCTGCTTACCGTAGTGAGATCACTTCATAGGTTATGCCTGGACTT 
GAACTCTGTCAATAGCATTTCAACATTTTTCAAAATCAGGAGATTTTCGTCCATTTAAAAAA 
TGTATAGGTGCAGATATTGAAACTAGGTGGGCACTTCAATGCCAAGTATATACTCTTCTTTA 
CATGGTGATGAGTTTCATTTGTAGAAAAATTTTGTTGCCTTCTTAAAAATTAGACACACTTT 
AAACCTTCAAACAGGTATTATAAATAACATGTGACTCCTTAATGGACTTATTCTCAGGGTCC 
TACTCTAAGAAGAATCTAATAGGATGCTGGTTGTGTATTAAATGTGAAATTGCATAGATAAA 
GGTAGATGGTAAAGCAATTAGTATCAGAATAGAGACAGAAAGTTACAACACAGTTTGTACTA 

ggSStttagtt^^'^'^"^^^^'^^^'^^'^ 

CAAAACTAATAACTGTTTTACTGCTTTAAGAAATAACAATTACAATGTGTATTATTTAAAAA 
TGGGAGAAATAGTTTGTTCTATGAAATAAACCTAGTTTAGAAATAGGGAAGCTGAGACATTT 
TAAGATCTCAAGTTTTTATTTAACTAATACTCAAAATATGGACTTTTCATGTATGCATAGGG 
AAGACACTTCACAAATTATGAATGATCATGTGTTGAAAGCCACATTATTTTATGCTATACAT 
TCTATGTATGAGGTGCTACATTTTTAGGACAAAGAATTCTGTAATCTTTTTCAAGAAAGAGT 
CTTTTTCTCCTTGACAAAATCCAGCTTTTGTATGAGGACTATAGGGTGAATTCTCTGATTAG 
TAATTTTAGATATGTCCTTTCCTAAAAATGAATAAAATTTATGAATATGA 



FIGURE 118 

</usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA57253 
<subunit 1 of 1, 413 aa, 1 stop 
<MW: 47070, pi: 9.92, NX(S/T): 3 

MENMLLWLIFFTPGWTLIDGSEMEWDFMWHLRKVPRIVSERTFHLTSPAFEADAKMMVNTVC 
GIECQKELPTPSLSELEDYLSYETVFENGTRTLTRVKYQDLVLEPTQNITTKGVSVRRKRQV 
YGTDSRFS I LDKRFLTNFPFSTAViCLSTGCSG IL IS PQHVLTAAHCVHDGKDYVKGSKKLRV 
GLLKMRNKSGGKKRRGSKRSRREASGGDQREGTREHLQERAKGGRRRKKSGRGQRIAEGRPS 
FQWTRVKNTHIPKGWARGGMGDATLDYDYALLELKRAHKKKYMELGISPTIKKMPGGMIHFS 
GFDJTORADQLVYRFCSVSDESNDLLYQYCDAESGSTGSGVYLRLKDPDKKNWiCRKIIAVYSG 
HQWVDVHGVQKDYNVAVRITPLKYAQICLWIHGNDANCAYG 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 90-93, 110-113 and 193-196 

Glycosaminoglycan attachment site. 

amino acids 236-239 

Serine proteases, trypsin family, histidine active site, 
amino acids 165-170 
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AATGTGAGAGGGGCTGATGGAAGCTGATAGGCAGGACTGGAGTGTTAGCACCAGTACTGGAT 
GTGACAGCAGGCAGAGGAGCACTTAGCAGCTTATTCAGTGTCCGATTCTGATTCCGGCAAGG 
ATCCAAGCATGGAATGCTGCCGTCGGGCAACTCCTGGCACACTGCTCCTCTTTCTGGCTTTC 
CTGCTCCTGAGTTCCAGGACCGCACGCTCCGAGGAGGACCGGGACGGCCTATGGGATGCCTG 
GGGCCCATGGAGTGAATGCTCACGCACCTGCGGGGGAGGGGCCTCCTACTCTCTGAGGCGCT 
GCCTGAGCAGCAAGAGCTGTGAAGGAAGAAATATCCGATACAGAACATGCAGTAATGTGGAC 
TGCCCACCAGAAGCAGGTGATTTCCGAGCTCAGCAATGCTCAGCTCATAATGATGTCAAGCA 
CCATGGCCAGTTTTATGAATGGCTTCCTGTGTCTAATGACCCTGACAACCCATGTTCACTCA 
AGTGCCAAGCCAAAGGAACAACCCTGGTTGTTGAACTAGCACCTAAGGTCTTAGATGGTACG 
CGTTGCTATACAGAATCTTTGGATATGTGCATCAGTGGTTTATGCCAAATTGTTGGCTGCGA 
TCACCAGCTGGGAAGCACCGTCAAGGAAGATAACTGTGGGGTCTGCAACGGAGATGGGTCCA 
CCTGCCGGCTGGTCCGAGGGCAGTATAAATCCCAGCTCTCCGCAACCAAATCGGATGATACT 
GTGGTTGCACTTCCCTATGGAAGTAGACATATTCGCCTTGTCTTAAAAGGTCCTGATCACTT 
ATATCTGGAAACCAAAACCCTCCAGGGGACTAAAGGTGAAAACAGTCTCAGCTCCACAGGAA 
CTTTCCTTGTGGACAATTCTAGTGTGGACTTCCAGAAATTTCCAGACAAAGAGATACTGAGA 
ATGGCTGGACCACTCACAGCAGATTTCATTGTCAAGATTCGTAACTCGGGCTCCGCTGACAG 
TACAGTCCAGTTCATCTTCTATCAACCCATCATCCACCGATGGAGGGAGACGGATTTCTTTC 
CTTGCTCAGCAACCTGTGGAGGAGGTTATCAGCTGACATCGGCTGAGTGCTACGATCTGAGG 
AGCAACCGTGTGGTTGCTGACCAATACTGTCACTATTACCCAGAGAACATCAAACCCAAACC 
CAAGCTTCAGGAGTGCAACTTGGATCCTTGTCCAGCCAGTGACGGATACAAGCAGATCATGC 
CTTATGACCTCTACCATCCCCTTCCTCGGTGGGAGGCCACCCCATGGACCGCGTGCTCCTCC 
TCGTGTGGGGGGGGCATCCAGAGCCGGGCAGTTTCCTGTGTGGAGGAGGACATCCAGGGGCA 
TGTCACTTCAGTGGAAGAGTGGAAATGCATGTACACCCCTAAGATGCCCATCGCGCAGCCCT 
GCAACATTTTTGACTGCCCTAAATGGCTGGCACAGGAGTGGTCTCCGTGCACAGTGACATGT 
GGCCAGGGCCTCAGATACCGTGTGGTCCTCTGCATCGACCATCGAGGAATGCACACAGGAGG 
CTGTAGCCCAAAAACAAAGCCCCACATAAAAGAGGAATGCATCGTACCCACTCCCTGCTATA 
AACCCAAAGAGAAACTTCCAGTCGAGGCCAAGTTGCCATGGTTCAAACAAGCTCAAGAGCTA 
GAAGAAGGAGCTGCTGTGTCAGAGGAGCCCTCGTAAGTTGTAAAAGCACAGACTGTTCTATA 
TTTGAAACTGTTTTGTTTAAAGAAAGCAGTGTCTCACTGGTTGTAGCTTTCATGGGTTCTGA 
ACTAAGTGTAATCATCTCACCAAAGCTTTTTGGCTCTCAAATTAAAGATTGATTAGTTTCAA 
AAAAAAAAA 
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</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA58847 
<subunit 1 of 1, 525 aa, 1 stop 
<MW: 58416, pi: 6.62, NX(S/T) : 1 

MECCRRATPGTLLLFLAFLLLSSRTARSEEDRDGLWDAWGPWSECSRTCGGGASYSLRRCLS 
SKSCEGRNIRYRTCSNVDCPPEAGDFRAQQCSAHNDVKHHGQFYEWLPVSNDPDNPCSLKCQ 
AKGTTLWELAPKVLDGTRCYTESLDMCISGLCQIVGCDHQLGSTVKEDNCGVCNGDGSTCR 
LVRGQYKSQLSATKSDDTWALPYGSRHIRLVLKGPDHLYLETKTLQGTKGENSLSSTGTFL 
VDNSSVDFQKFPDKEILRMAGPLTADFIVKIRNSGSADSTVQFIFYQPIIHRWRETDFFPCS 
ATCGGGYQLTSAECYDLRSNRWADQYCHYYPENIKPKPKLQECNLDPCPASDGYKQIMPYD 
LYHPLPRWEATPWTACSSSCGGGIQSRAVSCVEEDIQGHVTSVEEWKCMYTPKMPIAQPCNI 
FDCPKWLAQEWSPCTVTCGQGLRYRWLCIDHRGMHTGGCSPKTKPHIKEECIVPTPCYKPK 
EKLPVEAKLPWFKQAQELEEGAAVSEEPS 

Important features : 
Signal peptide: 

amino acids 1-25 

N-glycosylation site. 

amino acids 251-254 

Thrombospondin 1 

amino acids 385-399 

von Willebrand factor type C domain proteins 

amino acids 385-399, 445-459 and 42-56 
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CGGACGCGTGGGCGGCGGCTGCGGAACTCCCGTGGAGGGGCCGGTGGGCCCTCGGGCCTGAC 
AGATGGCAGTGGCCACTGCGGCGGCAGTACTGGCCGCTCTGGGCGGGGCGCTGTGGCTGGCG 
GCCCGCCGGTTCGTGGGGCCCAGGGTCCAGCGGCTGCGCAGAGGCGGGGACCCCGGCCTCAT 
GCACGGGAAGACTGTGCTGATCACCGGGGCGAACAGCGGCCTGGGCCGCGCCACGGCCGCCG 
AGCTACTGCGCCTGGGAGCGCGGGTGATCATGGGCTGCCGGGACCGCGCGCGCGCCGAGGAG 
GCGGCGGGTCAGCTCCGCCGCGAGCTCCGCCAGGCCGCGGAGTGCGGCCCAGAGCCTGGCGT 
CAGCGGGGTGGGCGAGCTCATAGTCCGGGAGCTGGACCTCGCCTCGCTGCGCTCGGTGCGCG 
CCTTCTGCCAGGAAATGCTCCAGGAAGAGCCTAGGCTGGATGTCTTGATCAATAACGCAGGG 
ATCTTCCAGTGCCCTTACATGAAGACTGAAGATGGGTTTGAGATGCAGTTCGGAGTGAACCA 
TCTGGGGCACTTTCTACTCACCAATCTTCTCCTTGGACTCCTCAAAAGTTCAGCTCCCAGCA 
GGATTGTGGTAGTTTCTTCCAAACTTTATAAATACGGAGACATCAATTTTGATGACTTGAAC 
AGTGAACAAAGCTATAATAAAAGCTTTTGTTATAGCCGGAGCAAACTGGCTAACATTCTTTT 
TACCAGGGAACTAGCCCGCCGCTTAGAAGGCACAAATGTCACCGTCAATGTGTTGCATCCTG 
GTATTGTACGGACAAATCTGGGGAGGCACATACACATTCCACTGTTGGTCAAACCACTCTTC 
AATTTGGTGTCATGGGCTTTTTTCAAAACTCCAGTAGAAGGTGCCCAGACTTCCATTTATTT 
GGCCTCTTCACCTGAG6TAGAAGGAGTGTCAGGAAGATACTTTGGGGATTGTA2^GAGGAAG 
AACTGTTGCCCAAAGCTATGGATGAATCTGTTGCAAGAAAACTCTGGGATATCAGTGAAGTG 
ATGGTTGGCCTGCTAAAATAGGAACAAGGAGTAAAAGAGCTGTTTATAAAACTGCATATCAG 
TTATATCTGTGATCAGGAATGGTGTGGATTGAGAACTTGTTACTTGAAGAAAAAGAATTTTG 
ATATTGGAATAGCCTGCTAAGAGGTACATGTGGGTATTTTGGAGTTACTGAAAAATTATTTT 
TGGGATAAGAGAATTTCAGCAAAGATGTTTTAAATATATATAGTAAGTATAATGAATAATAA 
GTACAATGAAAAATACAATTATATTGTAAAATTATAACTGGGCAAGCATGGATGACATATTA 
ATATTTGTCAGAATTAAGTGACTCAAAGTGCTATCGAGAGGTTTTTCAAGTATCTTTGAGTT 
TCATGGCCAAAGTGTTAACTAGTTTTACTACAATGTTTGGTGTTTGTGTGGAAATTATCTGC 
CTGGTGTGTGCACACAAGTCTTACTTGGAATAAATTTACTGGTAC 



FIGURE 122 

</usr/segdb2/sst/DNA/Dnaseqs .min/ss .DNA58747 
<subunit 1 of 1, 336 aa, 1 stop 
<MW: 36865, pi: 9.15, NX(S/T) : 2 

MAVATAAAVLAALGGALWLAARRFVGPRVQRLRRGGDPGLMHGKTVLITGANSGLGRATAAE 
LLRLGARVIMGCRDRARAEEAAGQLRRELRQAAECGPEPGVSGVGELIVRELDLASLRSVRA 
FCQEMLQEEPRLDVLINNAGIFQCPYMKTEDGFEMQFGVNHLGHFLLTNLLLGLLKSSAPSR 
IVWSSKLYKYGDINFDDLNSEQSYNKSFCYSRSKIJ^ILFTREIARRLEGTIWTVNVLHPG 
IVRTNLGRHIHIPLLVKPLFNLVSWAFFKTPVEGAQTSIYLASSPEVEGVSGRYFGDCKEEE 
LLPKAMDESVARKLWDISEVMVGLLK 

Important features: 
Signal peptide: 

amino acids 1-21 

Short -chain alcohol dehydrogenase family protein 

amino acids 134-144, 44-56 and 239-248 



N-glycosylation site. 

amino acids 212-215 and 239-242 
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GGGGATTGTAAAGAGGAAGNACTGTGCCCAAAGNTATGGATGAATCTGTTGCAAGAAAATTN 
TGGGATATCAGTGAAGTGATGGTTNGCCTGCTAAAATAGGAACAAGGAGTAAAAGAGCTGTT 
TATAAAACTGCATATCAGTTATATCTGTGATCAGGAATGGTGTGGATTGAGAACTTGTTACT 
TGAAGAAAAAGAATTTTGATATTGGAATAGCCTGNTAAGAGGNACATGTGGGTATTTTGGAG 
TTACTGAAAAATTATTTTTGGGATAAGAGAATTTCAGCAAAGATGTTTTAAATATATATAGT 
AAGTATAATGAATAATAAGTACAATGAAAAATACAATTATATTGTAAAATTATAACTGGGCA 
AGCATGGATGACATATTAATATTT6TCAGAATTAAGTGACTCAAAGTGCTATCGAGAGGTTT 
TTCAAGTATCTTTGAGTTTCATGGCCAAAGTGTTAACTAGTTTTACTACAATGTTTGGTGTT 
TGTGTGGAAATTATCTGCCTGGCTT 
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GAGAGGACGAGGTGCCGCTGCCTGGAGAATCCTCCGCTGCCGTCGGCTCCCGGAGCCCAGCC 
CTTTCCTAACCCAACCCAACCTAGCCCAGTCCCAGCCGCCAGCGCCTGTCCCTGTCACGGAC 
CCCAGCGTTACCATGCATCCTGCCGTCTTCCTATCCTTACCCGACCTCAGATGCTCCCTTCT 
GCTCCTGGTAACTTGGGTTTTTACTCCTGTAACAACTGAAATAACAAGTCTTGCTACAGAGA 
ATATAGATGAAATTTTAAACAATGCTGATGTTGCTTTAGTAAATTTTTATGCTGACTGGTGT 
CGTTTCAGTCAGATGTTGCATCCAATTTTTGAGGAAGCTTCCGATGTCATTAAGGAAGAATT 
TCCAAATGAAAATCAAGTAGTGTTTGCCAGAGTTGATTGTGATCAGCACTCTGACATAGCCC 
AGAGATACAGGATAAGCAAATACCCAACCCTCAAATTGTTTCGTAAT6GGATGATGATGAAG 
AGAGAATACAGGGGTCAGCGATCAGTGAAAGCATTGGCAGATTACATCAGGCAACAAAAAAG 
TGACCCCATTCAAGAAATTCGGGACTTAGCAGAAATCACCACTCTTGATCGCAGCAAAAGAA 
ATATCATTGGATATTTTGAGCAAAAGGACTCGGACAACTATAGAGTTTTTGAACGAGTAGCG 
AATATTTTGCATGATGACTGTGCCTTTCTTTCTGCATTTGGGGATGTTTCAAAACCGGAAAG 
ATATAGTGGCGACAACATAATCTACAAACCACCAGGGCATTCTGCTCCGGATATGGTGTACT 
TGGGAGCTATGACAAATTTTGATGTGACTTACAATTGGATTCAAGATAAATGTGTTCCTCTT 
GTCCGAGAAATAACATTTGAAAATGGAGAGGAATTGACAGAAGAAGGACTGCCTTTTCTCAT 
ACTCTTTCACATGAAAGAAGATACAGAAAGTTTAGAAATATTCCAGAATGAAGTAGCTCGGC 
AATTAATAAGTGAAAAAGGTACAATAAACTTTTTACATGCCGATTGTGACAAATTTAGACAT 
CCTCTTCTGCACATACAGAAAACTCCAGCAGATTGTCCTGTAATCGCTATTGACAGCTTTAG 
GCATATGTATGTGTTTGGAGACTTCAAAGATGTATTAATTCCTGGAAAACTCAAGCAATTCG 
TATTTGACTTACATTCTGGAAAACTGCACAGAGAATTCCATCATGGACCTGACCCAACTGAT 
ACAGCCCCAGGAGAGCAAGCCCAAGATGTAGCAAGCAGTCCACCTGAGAGCTCCTTCCAGAA 
ACTAGCACCCAGTGAATATAGGTATACTCTATTGAGGGATCGAGATGAGCTTTAAAAACTTG 
AAAAACAGTTTGTAAGCCTTTCAACAGCAGCATCAACCTACGTGGTGGAAATAGTAAACCTA 
TATTTTCATAATTCTATGTGTATTTTTATTTTGAATAAACAGAAAGAAATTTAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA57689 
<siibunit 1 of 1, 406 aa, 1 stop 
<MW: 46927, pi: 5.21, NX{S/T) : 0 

MHPAVFLSLPDLRCSLLLLVTWVFTPVTTEITSLATENIDEILNNADVALVNFYADWCRFSQ 
MLHPIFEEASDVIKEEFPNENQWFARVDCDQHSDIAQRYRISKYPTLKLFKNGMMMKREYR 
GQRSVKALADYIRQQKSDPIQEIRDLAEITTLDRSKRNIIGYFEQKDSDNyRVFERVANILH 
DDCAFLSAFGDVSKPERYSGDNI I YKPPGHSAPDMVYLGAMTNFDVTYNWIQDKCVPLVRE I 
TFENGEELTEEGLPFLILFHMKEDTESLEIFQNEVARQLISEKGTINFLHADCDKFRHPLLH 
I QKTPADCPVIAIDSFRHMYVFGDFKDVLI PGKLKQFVFDLHSGKLHREFHHGPDPTDTAPG 
EQAQDVASSPPESSFQKLAPSEYRYTLLRDRDEL 

Important features: 
Signal peptide: 

amino acids 1-29 

Endoplasmic reticulimi targeting sequence. 

amino acids 403-406 

Tyrosine kinase phosphorylation site- 

amino acids 203-211 

Thioredoxin family proteins 

amino acids 50-66 



m 
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ATTAAGGAAGAATTTCCAAATGAAAATCAAGTAGTNTTTGCCAGAGTNGATTGTGATCAGCA 
CTCTGACATAGCCCAGAGATACAGGATAAGCAAATACCCAACCCTCAAATTGTTTCGTAATG 
GGATGATGATGAAGAGAGAATACAGGGGTCAGCGATCAGTGAAAGCATTGGCAGATTA 



FIGURE 127 



AGAGGCCTCTCTGGAAGTTGTCCCGGGTGTTCGCCGCNGGAGCCCGGGTCGAGAGGACNAGG 
TGCCGCTGCCTGGAGAATCCTCCGCTGCCGTCGGCTCCCGGAGCCCAGCCCTTTCCTAACCC 
AACCCAACCTAGCCCNGTCCCAGCCGCCAGCGCCTGTCCCTGTCNCGGANCCCAGCGTNACC 
ATGCATCCTGCCGTCTTCCTATCCTTACCCGACCTCAGATGCTCCCTTCTGCTCCTGGTAAC 
TTGGGTTTTTACTCCTGTAACAACTGAAATAACmJGTCTTGATACNNAGAATATAGATGAAA 
TTTTAAACNATGCTGATGTGGCTTTAGTCAATTTTTATGCTGACTGGTGTCGTTTCAGTCAG 
ATGTGGCATCCAATTTTTGAGGANGCTTCCGATGTCATTAAGGAAGAATTTCCAAATGAAAA 
TCAAGTAGTGTTTGCCA6AGTTGATTGTGATCAGCACTCTGACATAGCCCAGAGATACAGGA 
TAAGCAAATACCCAACCCTCAAATTGTTTCGTAATGGGATGATGATGAAGAGAGAATACAGG 
GGTCAGCGATCAGTGAAAGCATTGGCAGATTACATCAGGC 



FIGURE 128 



GCCCACGCGTCCGATGGCGTTCACGTTCGCGGCCTTCTGCTACATGCTGGCGCTGCTGCTCA 
CTGCCGCGCTCATCTTCTTCGCCATTTGGCACATTATAGCATTTGATGAGCTGAAGACTGAT 
TACAAGAATCCTATAGACCAGTGTAATACCCTGAATCCCCTTGTACTCCCAGAGTACCTCAT 
CCACGCTTTCTTCTGTGTCATGTTTCTTTGTGCAGCAGAGTGGCTTACACTGGGTCTCAATA 
TGCCCCTCTTGGCATATCATATTTGGAGGTATATGAGTAGACCAGTGATGAGTGGCCCAGGA 
CTCTATGACCCTACAACCATCATGAATGCAGATATTCTAGCATATTGTCAGAAGGAAGGATG 
GTGCAAATTAGCTTTTTATCTTCTAGCATTTTTTTACTACCTATATGGCATGATCTATGTTT 
TGGTGAGCTCTTAGAACAACACACAGAAGAATTGGTCCAGTTAAGTGCATGCAAAAAGCCAC 
CAAATGAAGGGATTCTATCCAGCAAGATCCTGTCCAAGAGTAGCCTGTGGAATCTGATCAGT 
TACTTTAAAAAATGACTCCTTATTTTTTAAATGTTTCCACATTTTTGCTTGTGGAAAGACTG 
TTTTCATATGTTATACTCAGATAAAGATTTTAAATGGTATTACGTATAAATTAATATAAAAT 
GATTACCTCTGGTGTTGACAGGTTTGAACTTGCACTTCTTAAGGAACAGCCATAATCCTCTG 
AATGATGCATTAATTACTGACTGTCCTAGTACATTGGAAGCTTTTGTTTATAGGAACTTGTA 
GGGCTCATTTTGGTTTCATTGAAACAGTATCTAATTATAAATTAGCTGTAGATATCAGGTGC 
TTCTGATGAAGTGAAAATGTATATCTGACTAGTGGGAAACTTCATGGGTTTCCTCATCTGTC 
ATGTCGATGATTATATATGGATACATTTACAAAAATAAAAAGCGGGAATTTTCCCTTCGCTT 
GAATATTATCCCTGTATATTGCATGAATGAGAGATTTCCCATATTTCCATCAGAGTAATAAA 
TATACTTGCTTTAATTCTTAAGCATAAGTAAACATGATATAAAAATATATGCTGAATTACTT 
GTGAAGAATGCATTTAAAGCTATTTTAAATGTGTTTTTATTTGTAAGACATTACTTATTAAG 
AAATTGGTTATTATGCTTACTGTTCTAATCTGGTGGTAAAGGTATTCTTAAGAATTTGCAGG 
TACTACAGATTTTCAAAa.CTGAATGAGAGAAAATTGTATAACCATCCTGCTGTTCCTTTAGT 
GCAATACAATAAAACTCTGAAATTAAGACTC 
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< /usr / seqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA2 3330 
<subunit 1 of 1, 144 aa, 1 stop 
<MW: 16699, pi: 5.60, NX(S/T): 0 

MAFTFAAFCYMLALLLTAALIFFAIWHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFF 

CSmFLCAAEWLTLGLlSnyiPLLAYHimYMSRPVMSGPGLYDPTTim 

FYLLAFFYYLYGMIYVLVSS 

Important features: 
Signal peptide: 

amino acids 1-20 

Type II transmembrane domain: 

amino acids 11-31 

Other transmembrane domain: 

amino acids 57-77 and 123-143 
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ATTATAGCATTTGATGAGCTGAAGACTGATTACAAGATCCTATAGACCAGTGTAATACCCTG 
AATCCCCTTGTACTCCCA6AGTACCTCATCCACGCTTTCTTCTGTGTCATGTTTCTTTGTGC 
AGCAGAGTGGCTTACACTGGGTCTCAATATGCCCCTCTTGGCATATCATATTTGGAGGTATA 
TGAGTAGACCAGTGATGAGTGGCCCAGGACTCTATGACCCTACAACCATCATGAATGCAGAT 
ATTCTAGCATATTGTCAGAAGGAAGGATGGT6CAAATTAGCTTTTTATCTTCTAGCATTTTT 
TTACTACCTATATGGCATGATCTATGTTTTGGTGAGCTCTTAGAACAACACACAGAAGAATT 
GGTCCAGTTAAGTGCATGCAAAAAGCCACCAAATGAAGGGATTCTATCCAGCAAGATCCTGT 
CCAAGAGTAGCCTGTGGAATCTGATCAGTTACTTTAAAAAATG 
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CGGACGCGTGGGGGAAACCCTTCCGAGAAAACAGCAACAAGCTGAGCTGCTGTGACAGAGGG 
GAACAAGATGGCGGCGCCGAAGGGGAGCCTCTGGGTGAGGACCCAACTGGGGCTCCCGCCGC 
TGCTGCTGCTGACCATGGCCTTGGCCGGAGGTTCGGGGACCGCTTCGGCTGAAGCATTTGAC 
TCGGTCTTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAGTTGACCTACCCCTTGCACAC 
CTACCCTAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTGCAGGCTGTTTTCAATTTGTC 
AGTTTGTGGATGATGGAATTGACTTAAATCGAACTAAATTGGAATGTGAATCTGCATGTACA 
GAAGCATATTCCCAATCTGATGAGCAATATGCTTGCCATCTTGGTTGCCAGAATCAGCTGCC 
ATTCGCTGAACTGAGACAAGAACAACTTATGTCCCTGATGCCAAAAATGCACCTACTCTTTC 
CTCTAACTCTGGTGAGGTCATTCTGGAGTGACATGATGGACTCCGCACAGAGCTTCATAACC 
TCTTCATGGACTTTTTATCTTCAAGCCGATGACGGAAAAATAGTTATATTCCAGTCTAAGCC 
AGAAATCCAGTACGCACCACATTTGGAGCAGGAGCCTACAAATTTGAGAGAATCATCTCTAA 
GCAAAATGTCCTATCTGCAAATGAGAAATTCACAAGCGCACAGGAATTTTCTTGAAGATGGA 
GAAAGTGATGGCTTTTTAAGATGCCTCTCTCTTAACTCTGGGTGGATTTTAACTACAACTCT 
TGTCCTCTCGGTGATGGTATTGCTTTGGATTTGTTGTGCAACTGTTGCTACAGCTGTGGAGC 
AGTATGTTCCCTCTGAGAAGCTGAGTATCTATGGTGACTTGGAGTTTATGAATGAACAAAAG 
CTAAACAGATATCCAGCTTCTTCTCTTGTGGTTGTTAGATCTAAAACTGAAGATCATGAAGA 
AGCAGGGCCTCTACCTACAAAAGTGAATCTTGCTCATTCTGAAA TTTAA GCATTTTTCTTTT 
AAAAGACAAGTGTAATAGACATCTAAAATTCCACTCCTCATAGAGCTTTTAAAATGGTTTCA 
TTGGATATAGGCCTTAAGAAATCACTATAAAATGCAAATAAAGTTACTCAAATCTGTG 
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</usr/seqdb2/sst/DNA/Dnasegs .min/ss .DNA26847 
<siibunit 1 of 1, 323 aa, 1 stop 
<MW: 36223, pi: 5.06, NX(S/T): 1 

MAAPKGSLWVRTQLGLPPLLLLTMALAGGSGTASAEAFDSVLGDTASCHRACQLTYPLHTYP 
KEEELYACQRGCRLFSICQFVDDGIDLNRTKLECESACTEAYSQSDEQYACHLGCQNQLPFA 
ELRQEQLMSLMPKMHLLFPLTLVRSFWSDMMDSAQSFITSSWTFYLQADDGKIVIFQSKPEI 
QYAPHLEQEPTNLRESSLSKMSYLQMRNSQAHRNFLEDGESDGFLRCLSLNSGWILTTTLVL 
SVMVLLWICCATVATAVEQYVPSEKLSIYGDLEFmEQKIJSrRYPASSLWVRSKTEDHEEAG 
PLPTKVNLAHSEI 

Important features: 
Signal peptide: 

amino acids 1-31 

Transmembrane domain: 
amino acids 241-260 



N-glycosylation site. 

amino acids 90-93 
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TTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAGTTGACCTACCCCTTGCACACCTACCC 
TAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTGCAGGCTGTTTTCAATTTGTCAGTTTG 
TGGATGATGGAATTGACTTAAATCGAACTAAATTGGAATGTGAATCTGCATGTACAGAAGCA 
TATTCCCAATCTGATGAGCAATATGCTTGCCATCTTGGTTGCCAGAATCAGCTGCCATTCGC 
TGAACTGAGACAAGAACAACTTATGTCCCTGATGCCAAAAATGCACCTACTCTTTCCTCTAA 
CTCTGGTGAGGTCATTCTGGAGTGACATGATGGACTCCGC 
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CACACTGGCCGGATCTTTTAGAGTCCTTTGACCTTGACCAAGGGTCNGGAAAACAGCAACAA 
GCTGAGCTGCTGTGACAGAGGGAACAAGATGGCGGCGCCGAAGGGAGCCTTTGGGTGAGGAC 
CCAACTGGGGCTCCCGCCGCTGCTGCTGCTGACCATGGCCTTGGCCGGAGGTTCGGGGACCG 
CTTCGGCTGAAGCATTTGACTCGGTCTTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAG 
TTGACCTACCCCTTGCACACCTACCCTAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTG 
CAGGCTGTTTTCAATTTGTCAGTTTGTGGATGATGGAATTGACTTAAATCGAACTAAATTGG 
AATGTGAATCTGCATGTACAGAAGCATATTCCCAATCTGATGAGCAATATGCTTGCCATCTT 
GGTTGCCAGAATCAGCTGCCATTCGCTGAACTGAGACAAGAACAACTTATGTCCCTGATGCC 
AAAAATGCACCTACTCTTTCCTCTAACTCTGGTGAGGTCATTCTGGAGTGACATGATGGACT 
CCGC 
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GCGAGGTGGCGATCGCTGAGAGGCAGGAGGGCCGAGGCGGGCCTGGGAGGCGGCCCGGAGGT 
GGGGCGCCGCTGGGGCCGGCCCGCACGGGCTTCATCTGAGGGCGCACGGCCCGCGACCGAGC 
GTGCGGACTGGCCTCCCAAGCGTGGGGCGACAAGCTGCCGGAGCTGCA ATGG GCCGCGGCTG 
GGGATTCTTGTTTGGCCTCCTGGGCGCCGTGTGGCTGCTCAGCTCGGGCCACGGAGAGGAGC 
AGCCCCCGGAGACAGCGGCACAGAGGTGCTTCTGCCAGGTTAGTGGTTACTTGGATGATTGT 
ACCTGTGATGTTGAAACCATTGATAGATTTAATAACTACAGGCTTTTCCCAAGACTACAAAA 
ACTTCTTGAAAGTGACTACTTTAGGTATTACAAGGTAAACCTGAAGAGGCCGTGTCCTTTCT 
GGAATGACATCAGCCAGTGTGGAAGAAGGGACTGTGCTGTCAAACCATGTCAATCTGATGAA 
GTTCCTGATGGAATTAAATCTGCGAGCTACAAGTATTCTGAAGAAGCCAATAATCTCATTGA 
AGAATGTGAACAAGCTGAACGACTTGGAGCAGTGGATGAATCTCTGAGTGAGGAAACACAGA 
AGGCTGTTCTTCAGTGGACCAAGCATGATGATTCTTCAGATAACTTCTGTGAAGCTGATGAC 
ATTCAGTCCCCTGAAGCTGAATATGTAGATTTGCTTCTTAATCCTGAGCGCTACACTGGTTA 
CAAGGGACCAGATGCTTGGAAAATATGGAATGTCATCTACGAAGAAAACTGTTTTAAGCCAC 
AGACAATTAAAAGACCTTTAAATCCTTTGGCTTCTGGTCAAGGGACAAGTGAAGAGAACACT 
TTTTACAGTTGGCTAGAAGGTCTCTGTGTAGAAAftAAGAGCATTCTACAGACTTATATCTGG 
CCTACATGCAAGCATTAATGTGCATTTGAGTGCAAGATATCTTTTACAAGAGACCTGGTTAG 
AAAAGAAATGGGGACACAACATTACAGAATTTCAACAGCGATTTGATGGAATTTTGACTGAA 
GGAGAAGGTCCAAGAAGGCTTAAGAACTTGTATTTTCTCTACTTAATAGAACTAAGGGCTTT 
ATCCAAAGTGTTACCATTCTTCGA6CGCCCAGATTTTCAACTCTTTACTGGAAATAAAATTC 
AGGATGAGGAAAACAAAATGTTACTTCTGGAAATACTTCATGAAATCAAGTCATTTCCTTTG 
CATTTTGATGAGAATTCATTTTTTGCTGGGGATAAAAAAGAAGCACACAAACTAAAGGAGGA 
CTTTCGACTGCATTTTAGAAATATTTCAAGAATTATGGATTGTGTTGGTTGTTTTAAATGTC 
GTCTGTGGGGAAAGCTTCAGACTCAGGGTTTGGGCACTGCTCTGAAGATCTTATTTTCTGAG 
AAATTGATAGCAAATATGCCAGAAAGTGGACCTAGTTATGAATTCCATCTAACCAGACAAGA 
AATAGTATCATTATTCAACGCATTTGGAAGAATTTCTACAAGTGTGAAAGAATTAGAAAACT 
TCAGGAACTTGTTACAGAATATTCATTAAAGAAAACAAGCTGATATGTGCCTGTTTCTGGAC 
AATGGAGGCGAAAGAGTGGAATTTCATTCAAAGGCATAATAGCAATGACAGTCTTAAGCCAA 
ACATTTTATATAAAGTTGCTTTTGTAAAGGAGAATTATATTGTTTTAAGTAAACACATTTTT 
AAAAATTGTGTTi^GTCTATGTATAATACTACTGTGAGTAAAAGTAATACTTTAATAATGTG 
GTACAAATTTTAAAGTTTAATATTGAATAAAAGGAGGATTATCAAATTAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 136 



< /us r / seqdb2 / s s t /DNA/Dna seqs . min/ s s . DNA5 3 9 7 4 
<subunit 1 of 1, 468 aa, 1 stop 
<MW: 54393, pi: 5.63, NX(S/T): 2 

MGRGWGFLFGLLGAVWLLSSGHGEEQPPETAAQRCFCQVSGYIJDDCTCDVETIDRFNNYRLF 
PRLQKLLESDYFRYYKVNLKRPCPFWNDISQCGRRDCAVKPCQSDEVPDGIKSASYKYSEEA 
NNLIEECEQAERLGAVDESLSEETQKAVLQWTKHDDSSDNFCEADDIQSPEAEYVDLLLNPE 
RYTGYKGPDAWKIWNVIYEENCFKPQTIKRPLNPIASGQGTSEENTFYSWLEGLCVEKRAFY 
RLISGLHASINVHLSARYLLQETWLEKPCWGHNITEFQQRFDGILTEGEGPRRLKNLYFLYLI 
ELRALSKVLPFFERPDFQLFTGNKIQDEENKMLLLEILHEIKSFPLHFDENSFFAGDKKEAH 
KLKEDFRLHFRNISRIMDCVGCFKCRLWGKLQTQGLGTALKILFSEKLIANMPESGPSYEFH 
LTRQEIVSLFNAFGRISTSVKELENFRNLLQNIH 

Important features: 
Signal peptide: 

amino acids 1-23 

N-glycosylation site. 

amino acids 280-283 and 384-387 

Amidation site. 

amino acids 94-97 

Glycosaminoglycan attachment site. 

amino acids 20-23 and 223-226 

Aminotransferases class -V pyridoxal -phosphate 

amino acids 216-222 

Interleukin-7 proteins 

amino acids 33 8-343 



FIGURE 137 



GCTGGAAATATGGATGTCATCTACGAGAAACTGTTTTAAGCCACAGACAATTAAA?IGACCTT 
TAAATCCTTTGGCTTCTGGTCAAGGGACAAGTGAAGAGNACACTTTTTACAGTTGGCTAGAA 
GGTCTCTGTGTAGAAAAAAGAGCATTCTACAGACTTATATCTGGCCTACATGCAAGCATTAA 
TGTGCATTTGAGTGCAAGATATCTTTTACAAGAGACCTGGTTAGAAAAGAAATGGGGACACA 
ACATTACAGAATTTNAACAGCGATTTGATGGAATTTTGACTGAAGGAGAAGGTCCAAG2^GG 
CTTAAGAACTTGTATTTTCTCTACTTAATAGAACTAAGGGCTTTATCCAAAGTGTTACCATT 
CTTNGAGCGCCCAGATTTTCAACTNTTTACTGGAAATAAAATTCAGGATGAGGNAAACAAAA 
TGTTACTTTTGGAAATACTTCATGAAATCAAGTCATTTCCTTTGCATTTTGATGAGAATTCA 
TTTTTTTGCTG 



FIGURE 13 8 



CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGTTGGGAGGGGGCAGGATGGGAGGGAA 
AGTGAAGAAAACAGAAAAGGAGAGGGACAGAGGCCAGAGGACTTCTCATACTGGACAGAAAC 
CGATCAGGCATGGAACTCCCCTTCGTCACTCACCTGTTCTTGCCCCTGGTGTTCCTGACAGG 
TCTCTGCTCCCCCTTTAACCTGGATGAACATCACCCACGCCTATTCCCAGGGCCACCAGAAG 
CTGAATTTGGATACAGTGTCTTACAACATGTTGGGGGTGGACAGCGATGGATGCTGGTGGGC 
GCCCCCTGGGATGGGCCTTCAGGCGACCGGAGGGGGGACGTTTATCGCTGCCCTGTAGGGGG 
GGCCCACAATGCCCCATGTGCCAAGGGCCACTTAGGTGACTACCAACTGGGAAATTCATCTC 
ATCCTGCTGTGAATATGCACCTGGGGATGTCTCTGTTAGAGACAGATGGTGATGGGGGATTC 
ATGGTGAGCTAAGGAGAGGGTGGTGGCAGTGTCTCTGAAGGTCCATAAAAGAAAAAAGAGAA 
GTGTGGTAAGGGAAAATGGTCTGTGTGGAGGGGTCAAGGAGTTAAAAACCCTAGAAAGCAAA 
AGGTAGGTAATGTCAGGGAGTAGTCTTCATGCCTCCTTCAACTGGGAGCATGTTCTGAGGGT 
GCCCTCCCAAGCCTGGGAGTAACTATTTCCCCCATCCCCAGGCCTGTGCCCCTCTCTGGTCT 
CGTGCTTGTGGCAGCTCTGTCTTCAGTTCTGGGATATGTGCCCGTGTGGATGCTTCATTCCA 
GCCTCAGGGAAGCCTGGCACCCACTGCCCAACGTGAGCCAGAGGAAGGCTGAGTACTTGGTT 
CCCAGAAGGAGATACTGGGTGGGAAAAAGATGGGGCAAAGCGGTATGATGCCTGGCAAAGGG 
CCTGCATGGCTATCCTCATTGCTACCTAATGTGCTTGCAAAAGCTCCATGTTTCCTAACAGA 
TTCAGACTCCTGGCCAGGTGTGGTGGCCCACACCTGTAATTCTAGCACTTTGGGAGGCCAAG 
GTGGGCAGATCACTTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACTCCAT 
CTCTACTAAAAAAAAAAAAATACAAAAATTAGCTGGGTGCGCTAGTGCATGCCTGTAATCTC 
ATCTACTCGGGAGGCTAAGACAGGAGACTCTCACTTCAACCCAGGAGGTGGAGGTTGCGGTG 
AGCCAAGATTGTGCCTCTGCACTCTAGCGTGGGTGACAGAGTAAGCGAGACTCCATCTCAAA 
AATAATAATAATAATAATTCAGACTCCTTATCAGGAGTCCATGATCTGGCCTGGCACAGTAA 
CTCATGCCTGTAATCCCAACATTTTGGGAGGCCAACGCAGGAGGATTGCTTGAGGTCTGGAG 
GTTTGAGACCAGCCTGGGCAACATAGAAAGACCCCATCTCTAAATAAATGTTTTAAAAAT 



FIGURE 139 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA57039 
xsubunit 1 of 1, 124 aa, 1 stop 
xMW: 13352, pi: 5.99, NX(S/T): 1 

MELPFVTHLFLPLVFLTGLCSPPNLDEHHPRLPPGPPEAEFGYSVLQHVGGGQRWMLVGAPW 
DGPSGDRRGDVYRCPVGGAHNAPCAKGHLGDYQLGNSSHPAVNMHLGMSLLETDGDGGFMVS 

Important features : 
Signal peptide: 

amino acids 1-22 

Cell attachment sequence. 

amino acids 70-73 

N-glycosylation site. 

amino acids 98-101 

Integrins alpha chain proteins 

amino acids 67-81 



FIGURE 140 



CACAGTTCCCCACCATCACTCNTCCCATTCCTTCCAACTTTATTTTTAGCTTGCCATTGGGA 

GGGGGCAGGATGGGAGGGAAAGTGAAGAAAACAGAAAAGGAGAGGGACAGAGGCCAGAGGAC 

TTCTCATACTGGACAGAAACCGATCAGGCATGGAACTCCCCTTCGTCACTCACCTGTTCTTG 

CCCCTGGTGTTCCTGACAGGTCTCTGCTCCCCCTTTAACCTGGATGAACATCACCCACGCCT 

ATTCCCAGGGCCACCAGAAGCTGAATTTGGATACAGTGTCTTACAACATGTTGGGGGTGGAC 

AGCGATGGATGCTGGTGGGCGCCCCCTGGGATGGGCCTTCAGGCGACCGGAGGGGGGACGTT 

TATCGCTGCCCTGTAGGGGGGGCCCACAATGCCCCATGTGCCAAGGGCCACTTAGGTGACTA 

CCAACTGGGAAATTCATCTCATCCTGCTGTGAATATGCACCTGGGGATGTCTCTGTTAGAGA 
CAGATGGTGATGG 



FIGURE 141 

AAAGTTACATTTTCTCTGGAACTCTCCTAGGCCACTCCCTGCTGATGCAACATCTGGGTTTG 
GGCAGAAAGGAGGGTGCTTCGGAGCCCGCCCTTTCTGAGCTTCCTGGGCCGGCTCTAGAACA 
ATTCAGGCTTCGCTGCGACTCAGACCTCAGCTCCAACATATGCATTCTGAAGAAAGATGGCT 
GAGATGGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGGTCAAACTGAGTCTACCA 
AATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCTTTTCATGTGGTTTTTCT 
ACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGCCTGCCCCTCAGAACCTC 
TCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCAGTGATCGCGCCTGGAGA 
AACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCTGTACACGAGCCACATCT 
GGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTGATGTCACTGATGACATC 
ACGGCCACTGTGCCATACAACCTTCGTGTCAGGGCCACATTGGGCTCACAGACCTCAGCCTG 
GAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTACCCGACCTGGGATGGAGA 
TCACCAAAGATGGCTTCCACCTGGTTATTGAGCTGGAGGACCTGGGGCCCCAGTTTGAGTTC 
CTTGTGGCCTACTGGAGGAGGGAGCCTGGTGCCGAGGAACATGTCAAAATGGTGAGGAGTGG 
GGGTATTCCAGTGCACCTAGAAACCATGGAGCCAGGGGCTGCATACTGTGTGAAGGCCCAGA 
CATTCGTGAAGGCCATTGGGAGGTACAGCGCCTTCAGCCAGACAGAATGTGTGGAGGTGCAA 
GGAGAGGCCATTCCCCTGGTACTGGCCCTGTTTGCCTTTGTTGGCTTCATGCTGATCCTTGT 
GGTCGTGCCACTGTTCGTCTGGAAAATGGGCCGGCTGCTCCAGTACTCCTGTTGCCCCGTGG 
TGGTCCTCCCAGACACCTTGAAAATAACCAATTCACCCCAGAAGTTAATCAGCTGCAGAAGG 
GAGGAGGTGGATGCCTGTGCCACGGCTGTGATGTCTCCTGAGGAACTCCTCAGGGCCTGGAT 
CTCATAGGTTTGCGGAAGGGCCCAGGTGAAGCCGAGAACCTGGTCTGCATGACATGGAAACC 
ATGAGGGGACAAGTTGTGTTTCTGTTTTCCGCCACGGACAAGGGATGAGAGAAGTAGGAAGA 
GCCTGTTGTCTACAAGTCTAGAAGCAACCATCAGAGGCAGGGTGGTTTGTCTAACAGAACAC 
TGACTGAGGCTTAGGGGATGTGACCTCTAGACTGGGGGCTGCCACTTGCTGGCTGAGCAACC 
CTGGGAAAAGTGACTTCATCCCTTCGGTCCTAAGTTTTCTCATCTGTAATGGGGGAATTACC 
TACACACCTGCTAAACACACACACACAGAGTCTCTCTCTATATATACACACGTACACATAAA 
TACACCCAGCACTTGCAAGGCTAGAGGGAAACTGGTGACACTCTACAGTCTGACTGATTCAG 
TGTTTCTGGAGAGCAGGACATAAATGTATGATGAGAATGATCAAGGACTCTACACACTGGGT 
GGCTTGGAGAGCCCACTTTCCCAGAATAATCCTTGAGAGAAAAGGAATCATGGGAGCAATGG 
TGTTGAGTTCACTTCAAGCCCAATGCCGGTGCAGAGGGGAATGGCTTAGCGAGCTCTACAGT 
AGGTGACCTGGAGGAAGGTCACAGCCACACTGAAAATGGGATGTGCATGAACACGGAGGATC 
CATGAACTACTGTAAAGTGTTGACAGTGTGTGCACACTGCAGACAGCAGGTGAAATGTATGT 
GTGCAATGCGACGAGAATGCAGAAGTCAGTAACATGTGCATGTTTGTTGTGCTCCTTTTTTC 
TGTTGGTAAAGTACAGAATTCAGCAAATAAAAAGGGCCACCCTGGCCAAAAGCGGTAAAAAA 
AAAAAAi^AAA 



FIGURE 142 

< /us r / seqdb2 / s s t /DNA/Dnas egs . min/ s s . DNA5 7033 
<STibunit 1 of 1, 311 aa, 1 stop 
<MW: 35076, pi: 5.04, NX(S/T): 2 

MQTFTMVLEEIWTSLFMWFFYALIPCLLTDEVAILPAPQNLSVLSTNMKHLLMWSPVIAPGE 
TVYYSVEYQGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITATVPYNLRVRATLGSQTSAW 
SILKHPFl^STILTRPGMEITKDGFHLVIELEDLGPQFEFLVAYWRREPGAEEHVKMVRSG 
GIPVHLETMEPGAAYCVKAQTFVKAIGRYSAFSQTECVEVQGEAIPLVLALFAFVGFMLILV 
WPLFVWKMGRLLQYSCCPVVVLPDTLKITNSPQKLISCRREEVDACATAVMSPEELLRAWIS 

Important features: 
Signal peptide: 

amino acids 1-29 

Transmembrane domain: 

amino acids 230-255 

N-glycosylation site. 

amino acids 40-43 and 134-137 

Tissue factor proteins. 

amino acids 92-119 



Integrins alpha chain proteins 

amino acids 232-262 



FIGURE 143 



TCCTGCTGATGCACATCTGGGTTTGGCAAAAGGAGGTTGCTTCGAGCCGCCCTTTCTAGCTT 
CCTGGCCGGCTCTAGAACAATTCAGGCTTCGCTGCGACTAGACCTCAGCTCCAACATATGCA 
TTCTGAAGAAAGATGGCTGAGATGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGG 
TCAAACTGAGTCTACCAAATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCT 
TTTCATGTGGTTTTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGC 
CTGCCCCTCAGAACCTCTCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCA 
GTGATCGCGCCTGGAGAAACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCT 
GTACACGAGCCACATCTGGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTG 
ATGTCACTGATGACATCACGGCCACTGTGCCATACAACCTTTGTGTCAGGGCCACATTGGGC 
TCACAGACCTCAGCCTGGAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTAC 
CCGACCTGGGATGGAGATCACCAAAGATGGCTTNCACCTGGTTATTGAGCTGGAGGACCTGG 
GGCCCCAGTTTGAGTTCCTTGTGGCCTANTGGAGGAGGGGCGAACCCCTTGCGGCGCAAGGG 
GTTNGCGAACCCCTTGCGGCCGCTGGGGTATCTCTCGAGAAAAGAGAGGCCCAATATGACCC 
ACATACTCAATATGGACGAANTGCTATTGTCCACCTGTTTGAGTGGCGCTGGGTTGAT 



li'muiiiiiiiii 



FIGURE 144 



CCCACGCGTCCGCCCACGCGTCCGAGGGACAAGAGAGAAGAGAGACTGAAACAGGGAGAAGA 
GGCAGGAGAGGAGGAGGTGGGGAGAGCACGAAGCTGGAGGCCGACACTGAGGGAGGGCGGGA 
GGAGGTGAAGAAGGAGAGAGGGGAGAAGAGGCAGGA6CTGGAAAGGAGAGAGGGAGGAGGAG 
GAGGAGATGCGGGATGGAGACCTGGAGTTAGGTGGCTTGGGAGAGCTTAATGAAAAGAGAAC 
GGAGAGGAGGTGTGGGTTAGGAACCAAGAGGTAGCCCTGTGGGCAGCAGAAGGCTGAGAGGA 
GTAGGAAGATCAGGAGCTAGAGGGAGACTGGAGGGTTCCGGGAAAAGAGCAGAGGAAAGAGG 
AAAGACACAGAGAGACGGGAGAGAGAAGAAGAGTGGGTTTGAAGGGCGGATCTCAGTCCCTG 
GCTGCTTTGGCATTTGGGGAACTGGGACTCCCTGTGGGGAGGAGAGGAAAGCTGGAAGTCCT 
GGAGGGACAGGGTCCCAGAAGGAGGGGACAGAGGAGCTGAGAGAGGGGGGCAGGGCGTTGGG 
CAGGGGTCCCTCGGAGGCCTCCTGGGGATGGGGGCTGCAGCTCGTCTGAGCGCCCCTCGAGC 
GCTGGTACTCTGGGCTGCACTGGGGGCAGCAGCTCACATCGGACCAGCACCTGACCCCGAGG 
ACTGGTGGAGCTACAAGGATAATCTCCAGGGAAACTTCGTGCCAGGGCCTCCTTTCTGGGGC 
CTGGTGAATGCAGCGTG6AGTCTGTGTGCTGTGGGGAAGCGGCAGAGCCCCGTGGATGTGGA 
GCTGAAGAGGGTTCTTTATGACCCCTTTCTGCCCCCATTAAGGCTCAGCACTGGAGGAGAGA 
AGCTCCGGGGAACCTTGTACAACACCGGCCGACATGTCTCCTTCCTGCCTGCACCCCGACCT 
GTGGTCAATGTGTCTGGAGGTCCCCTCCTTTACAGCCACCGACTCAGTGAACTGCGGCTGCT 
GTTTGGAGCTCGCGACGGAGCCGGCTCGGAACATCAGATCAACCACCAGGGCTTCTCTGCTG 
AGGTGCAGCTCATTCACTTCAACCAGGAACTCTACGGGAATTTCAGCGCTGCCTCCCGCGGC 
CCCAATGGCCTGGCCATTCTCAGCCTCTTTGTCAACGTTGCCAGTACCTCTAACCCATTCCT 
CAGTCGCCTCCTTAACCGCGACACCATCACTCGCATCTCCTACAAGAATGATGCCTACTTTC 
TTCAAGACCTGAGCCTGGAGCTCCTGTTCCCTGAATCCTTCGGCTTCATCACCTATCAGGGC 
TCTCTCAGCACCCCGCCCTGCTCCGAGACTGTCACCTGGATCCTCATTGACCGGGCCCTCAA 
TATCACCTCCCTTCAGATGCACTCCCTGAGACTCCTGAGCCAGAATCCTCCATCTCAGATCT 
TCCAGAGCCTCAGCGGTAACAGCCGGCCCCTGCAGCCCTTGGCCCACAGGGCACTGAGGGGC 
AACAGGGACCCCCGGCACCCCGAGAGGCGCTGCCGAGGCCCCAACTACCGCCTGCATGTGGA 
TGGTGTCCCCCATGGTCGCTGAGACTCCCCTTCGAGGATTGCACCCGCCCGTCCTAAGCCTC 
CCCACAAGGCGAGGGGAGTTACCCCTAAAACAAAGCTATTAAAGGGACAGAATACTTA 



FIGURE 145 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA34353 
<s\ibun.it 1 of 1/ 328 aa, 1 stop 
<MW: 36238, pi: 9.90, NX(S/T): 3 

MGAAARLSAPRALVLWAALGAAAHIGPAPDPEDWWSYKDNLQGNFVPGPPFWGLWAAWSLC 
AVGKRQSPVDVELKRVLYDPFLPPLRLSTGGEKLRGTLYNTGRHVSFLPAPRPWNVSGGPL 
LYSHRLSELRLLFGARDGAGSEHQINHQGFSAEVQLIHFNQELYGNFSAASRGPNGLAILSL 
FVNVASTSNPFLSRLLNKDTITRISYKNDAYFLQDLSLELLPPESFGFITYQGSLSTPPCSE 
TVTWILIDRALNITSLQMHSLRLLSQNPPSQIFQSLSGNSRPLQPLAHRALRGNRDPRHPER 
RCRGPNYRLHVDGVPHGR 

Important features: 
Signal peptides 

amino acids 1-23 

Transmembrane domain: 

amino acids 177-199 

N-glycosylation site. 

amino acids 118-121, 170-173 and 260-263 

Eukaryotic-type carbonic anhydrases proteins 

amino acids 222-270, 128-164 and 45-92 



FIGURE 146 



GGCGCCTGGTTCTGCGCGTACTGGCTGTACGGAGCAGGAGCAAGAGGTCGCCGCCAGCCTCCGCCGCCGAGCCTC 
GTTCGTGTCCCCGCCCCTCGCTCCTGCAGCTACTGCTCAGAAACGCTGGGGCGCCCACCCTGGCAGACTi^CGAA 
GOlGCTCCCTTCCCaCCCCyiACTGCAGGTCTi^TTTTGQACGCTTTGCCTGCCATTTCTTCCAGGTTGAGGGAGC 
CGCAGAGGCGGAGGCTCGCGTATTCCTGCAGTCAGCACCCACGTCGCCCCCGGACGCTCGGTGCTCAGGCCCTTC 
GCGAGCGGGGCTCTCCGTCTGCGGTCCCTTGTGAAGGCTCTGGGCGGCTGCAGAGGCCGGCCGTCCGGTTTGGCT 
CACCTCTCCCAGGAaACTTCACACTGGAGAGCCaAAAGGAGTGGAAGAGCCTGTCTTGGAGATTTTCCTGGGGAA 
ATCCTGAGGTCa.TTCa.TTATGA^GTGTACCGCGCGGQAGTGGCTCAGAGTA&CCarAG'rGrTGTTrATGGr'TaGa 
GCAATTCCAGCCATGGTGGTTCCCAATGCCACTTTATTGGAGAAACTTTTGQaAAAATACATGGATGAGGATGGT 
GAGTGGTGGATAGCCAAACaACGAGGGAAAAGGGCCATa^CAGaCAATGACATGC^ 

AATAAATTACGAAGTCAGGTGTATCCAACAGCCTCTAATATGGAGTATATGACATGGQATGTAGAGCTGGAAAC^ 

TCTGCAGAATCCTGGGCTGAAAGTTGCTTGTGGGAACATGGACCTGCAAGCTTGCTTCCATCAATTGGACAGaAT 

TTGGGAGCACACTGGGGAAGATATAGGCCCCCQACGTTTCATGTACAATCGTGGTATGATGAAGTGAAAGACTTT 

AGCTACCCATATQAACaTGAATGCaACCCM'ATTGTC(::aTTCAGGTGTTCTGGCCCTGTATGTA<^ 

CAGGTCGTGTGGGCAACTAGTAAOVGAATCGGTTGTGCCATTAATTTGTGTCATAAaVTGAACATCTGGGGGCAG 

ATATGGCCCaaAGCTGTCTACCTGGTGTGCAATTACTCCCCAAAGGGAAACTGGTGGGGCCATGCCCCTTACAAA 

CATGGGCGGCCCTGTTCTGCTTGCCCACCTAGTTTTGGAGGGGGCTGTAGAGAAaATCTGTGCTACAAAGAAGGG 

TCAGACAGGTATTATCCCCCTCGAGAAGAGGAAACyiAATGAaATAGAACGACAGaiGTCACAAGTCC^^ 

CATGTCCGGACAAGATCAGATGATAGTAGCTkGAAATGAAGTCATAAGCGCACAGCAAATGTCCCAAATTC 

TGTGJ^GTAAGATTAAGAGATCAGTGCAAR.GQAACaACCTGCAATAGGTACGAATGTCCTGCTGGCTGTTTGGAT 

AGTAAAGCTAAAGTTATTGGCaVGTGTACATTATGAAATGCS^TCCyiGCATCTGTAGAGCTGCAATTCATT^^ 

ATAATAGACAATGATGGTGGCTGGGTAGATATCACTAGACAAGGAAGAAAGCATTATTTCATCaAGTCC^ 

AATGGTATTCAAACAATTGGOW^TATCRGTCTGCTAATTCCTTa^CAGTCTCTAAAGTAACAGTTC^ 

ACTTGTGAAACAACTGTGGAACAGCTCTGTCCATTTCATAAGCCTGCTTCACATTGCCCAAGAGTATACTGTCCT 

CGTAACTGTATGCaAGCAAATCCACATTATGCTCGTGTAATTGGi^CTCGAGTTTATTCTGATCTGTCCAGTATC 

TGCAGAGCAGCAGTACATGCTGQAGTGGTTCGAAATCACGGTGGTTATGTTGATGTAATGCCTGTGGACyiAAAGA 

AAGACCTACATTGCTTCTTTTCAGAATGGAATCTTCTCAGAAAGTTTACAGAATCCTCCAGGAGGAAAGGCATTC 

AGAGTGTTTGCTGTTGTGTGAAACTGAATACTTGGAAGAGGACCATAAAGACTATTCCAAATGCAATATTTCTGA 

ATTTTGTATAAAACTGTAACATTACTGTACAGAGTACaTCAACTATTTTCyiGCCCAAAAAGGTGCCAAATGC^ 

TAAATCTTGATAAACAAAGTCTATAAAATAAAACATGGGACATTAGCTTTGGGAAAAGTAATGAAAATATAATGG 

TTTTAGAAATCCTGTGTTAAATATTGCTATATTTTCTTAGCAGTTATTTCTACAGTTAATTACATAGTCaTGATT 

GTTCTACGTTTCATATATTATATGGTGCTTTGTATATGCCACTAATA2!^TGAATCTAAACATTGaATGTGAATG 

GCCCTCAGRAAATCATCTAGTGCaTTTAAAAATAATCGACTCTAAAACTGAAAGAAACCTTATCACATTTTCCCC 

AGTTCaATGCTATGCCATTACCaACTCCA?U^TAATCTCAAATAATTTTCCACTTAATAACTGTAAAGTTTTTTTC 

TGTTAATTTAGGCATATAGAATATTAAATTCTGATATTGCACTTCTTATTTTATATAAAATAATCCTTTAATATC 

CAAATGAATCTGTTAAAATGTTTGATTCCTTGGGAATGGCCTTAAAAATAAATGTAATAAAGTCAGAGTGGTGGT 

ATGAAAACATTCCTAGTGATCATGTAGTAAATGTAGGGTTAAGCATGGACAGCCAGAGCTTTCTATGTACTGTTA 

AAATTGAGGTCACATATTTTCTTTTGTATCCTGGCAAATACTCCTGCAGGCCAGGAAGTATAATAGCAAAAAG 

GAACaAAGATGAACTAATGTATTAO^TTACCATTGCCACTGATTTTTTTTAAATGGTAAATGACCTTGTATATAA 

AATCTGTAAAATGTTAGTTTTGGTAATTTTTTTTCTGCTGGTGGATTTACATATTAAATTTTTTCTGCTGGTGGA 
TAAACATTAAAATTAATCATGTTTCAAAAAAAAAAAAA 



FIGURE 147 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA45417 
<subunit 1 of 1, 500 aa, 1 stop 
<MW: 56888, pi: 8.53, NX(S/T): 2 

MKCTAREWLRVTTVLFMARAIPAMWPNATLLEKLLEKYMDEDGEWWIAKQRGKR^ 

QSILDLHNKLRSQVYPTASNMEYMTWDVELERSAESWAESCLWEHGPASLLPSIGQNLGAHW 

GRYRPPTFHVQSWYDEVKDFSYPYEHECNPYCPFRCSGPVCTHYTQWWATSNRIGCAINLC 

HNMNIWGQIWPKAVYLVCNYSPKGNWWGHAPYKHGRPCSACPPSFGGGCRENLCYKEGSDRY 

YPPREEETNEIERQQSQVHDTHVRTRSDDSSRNEVISAQQMSQIVSCEVRLRDQCKGTTCNR 

YECPAGCLDSKAKVIGSVHYEMQSS ICRAAIHYGI IDNDGGWVDITRQGRKHYFIKSNRNGI 

QTIGKYQSANSFTVSKVTVQAVTCETTVEQLCPFHKPASHCPRVYCPRNCMQANPHYARVIG 

TRVYSDLSSICRAAVHAGWRNHGGYVDVMPVDKRKTYIASFQNGIFSESLQNPPGGKAFRV 
FAW 

Important features: 
Signal peptide: 

amino acids 1-20 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 protein 

amino acids 165-186, 196-218, 134-146, 96-108 and 58-77 

N-glycosylation site 

amino acids 28-31 



FIGURE 148 



GCGGAGACAAGCGCAGAGCGCAGCGCACGGCCACAGACAGCCCTGGGCATCCACCGACGGCG 
CAGCCGGAGCCAGCAGAGCCGGAAGGCGCGCCCCGGGCAGAGAAAGCCGAGCAGAGCTGGGT 
GGCGTCTCCGGGCCGCCGCTCCGACGGGCCAGCGCCCTCCCCATGTCCCTGCTCCCACGCCG 
CGCCCCTCCGGTCAGCATGAGGCTCCTGGCGGCCGCGCTGCTCCTGCTGCTGCTGGCGCTGT 
ACACCGCGCGTGTGGACGGGTCCAAATGCAAGTGCTCCCGGAAGGGACCCAAGATCCGCTAC 
AGCGACGTGAAGAAGCTGGAAATGAAGCCAAAGTACCCGCACTGCGAGGAGAAGATGGTTAT 
CATCACCACCAAGAGCGTGTCCAGGTACCGAGGTCAGGAGCACTGCCTGCACCCCAAGCTGC 
AGAGCACCAAGCGCTTCATCAAGTGGTACAACGCCTGGAACGAGAAGCGCAGGGTCTACGAA 
GAATAGGGTGAAAAACCTCAGAAGGGAAAACTCCAAACCAGTTGGGAGACTTGTGCAAAGGA 
CTTTGCAGATTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCCTTTC 
TTTCTCACAGGCATAAGACACAAATTATATATTGTTATGAAGCACTTTTTACCAACGGTCAG 
TTTTTACATTTTATAGCTGCGTGCGAAAGGCTTCCAGATGGGAGACCCATCTCTCTTGTGCT 
CCAGACTTCATCACAGGCTGCTTTTTATCAAAAAGGGGAAAACTCATGCCTTTCCTTTTTAA 
AAAATGCTTTTTTGTATTTGTCCATACGTCACTATACATCTGAGCTTTATAAGCGCCCGGGA 
GGAACAATGAGCTTGGTGGACACATTTCATTGCAGTGTTGCTCCATTCCTAGCTTGGGAAGC 
TTCCGCTTAGAGGTCCTGGCGCCTCGGCACAGCTGCCACGGGCTCTCCTGGGCTTATGGCCG 
GTCACAGCCTCAGTGTGACTCCACAGTGGCCCCTGTAGCCGGGCAAGCAGGAGCAGGTCTCT 
CTGCATCIGTTCTCTGAGGAACTCAAGTTTGGTTGCCAGAAAAATGTGCTTCATTCCCCCCT 
GGTTAATTTTTACACACCCTAGGAAACATTTCCAAGATCCTGTGATGGCGAGACAAATGATC 
CTTAAAGAAGGTGTGGGGTCTTTCCCAACCTGAGGATTTCTGAAAGGTTCACAGGTTCAATA 
TTTAATGCTTCAGAAGCATGTGAGGTTCCCAACACTGTCAGCAAAAACCTTAGGAGAAAACT 
TAAAAATATATGAATACATGCGCAATACACAGCTACAGACACACATTCTGTTGACAAGGGAA 
AACCTTCAAAGCATGTTTCTTTCCCTCACCACAACAGAACATGCAGTACTAAAGCAATATAT 
TTGTGATTCCCCATGTAATTCTTCAATGTTAAACAGTGCAGTCCTCTTTCGAAAGCTAAGAT 
GACCATGCGCCCTTTCCTCTGTACATATACCCTTAAGAACGCCCCCTCCACACACTGCCCCC 
CAGTATATGCCGCATTGTACTGCTGTGTTATATGCTATGTACATGTCAGAAACCATTAGCAT 
TGCATGCAGGTTTCATATTCTTTCTAAGATGGAAAGTAATAAAATATATTTGAAATGTAAAA 
AAAAJW^AAAA 



FIGURE 149 



MSLLPRRAPPVSmLLAAALLLLLLALYTARVDGSKCKCSRKGPKIRYSDVKKLEMKPKYPH 
CEEKMVIITTKBVSRYRGQEHCLHPKLQSTKRFIKWYNAWNEKRRVYEE 

Signal sequence: 

amino acids 1-34 



FIGURE 150 

GCCCCAGGGACTGCTATGGCTTCCTTTGTTGTTCACCCCGGTCTGCGTCATGTTiU^CTCCaATGTCCTCCTGTG 

CTTiy^CTGCTCTTGCCATCAAGTTCACCCTCATTGACAGCC^^GCaCAGTATCCAGTTGTCftAC^ 

CAAi^TCCGGGGCCTi^GAACACCGTTACCCAATGAGATCTTGGGTCCS^GTGGAGCAGTACTTAGGGGTCCCC^^ 

TGCCTCACCCCCCACTGGAGAGAGGCGGTTTCAGCCCCCAGAACCCCCGTCCTCCTGGACTGGCATCCGAAATAC 

TACTCAGTTTGCTGCTGTGTGCCCCCAGCACCTGGATQAGAGATCCTTACTGCATGACATGCTGCCCATCTGGTT 

TACCGCCAATTTGGATACTTTGATGACCTATGTTCAAGATCAAAATGAAGACTGCCTTTACTTAAACATCTACGT 

GCCCACGGAAGATGQAGCCAACACy^GAAAAACGCAGATGATATAACGAGTAATGACCGTGGTGAAGACGAAGA 

TATTCATGATCAGAACAGTAAGAAGCCCGTCATGGTCTATATCCATGGGGGATCTTACATGGAGGGCACCGGCAA 

CATGATTGACGGCAGCATTTTGGCaAGCTACGGAAACGTa^TCGTGATO^CCATTAACTACCGTCTGGGAATACT 

AGGGTTTTTAAGTACCGGTGACCAGGCAGCAAAAGGCAACTATGGGCTCCTGGATCAGATTCAAGCACTGCGGTG 

GATTGAGGAGAATGTGGGAGCCTTTGGCG6GGACCCCAAGAGAGTGACCATCTTTGGCTCGGGGGCTGGGGCCTC 

CTGTGTCAGCCTGTTGACCCTGTCCCACTACTCAGAAGGTCTCTTCCAGAAGGCCATCATTCAGAGCGGCACCGC 

CCTGTCCAGCTGGGCAGTGAACTACCAGCCGGCCAAGTACACTCGGATATTGGCAGACAAGGTCGGCTGCAAC^^ 

GCTGGACACCACGGACATGGTAGAATGCCTGCGQAACAAGAACTACAAGGAGCTCATCCaGCAGACC^^ 

GGCCACCTACCACATAGCCTTCGGGCCGGTGATCGACGGCGACGTCATCCCAGACGACCCCCAGATCCTQAT^^ 

GCAAGGCGAGTTCCTCaACTACGACATCATGCTGGGCGTCAACCAAGGGGAAGGCCTGAAGTTCGTGGACGGCA^ 

CGTGGATAACGAGGACGGTGTGACGCCCAACGACTTTGACTTCTCCGTGTCCAACTTCGTGGACAACCTTTACGG 

CTACCCTGAAGGGAAAGACACTTTGCGGGAGACTATCAAGTTCaTGTACACAGACTGGGCCGATAAGQAAAACCC 

GGAGACGCGGCGGAAAACCCTGGTGGCTCTCTTTACTGACCACCAGTGGGTGGCCCCCGCCGTGGCCGCCGACCT 

GCACGCGCAGTACGGCTCCCCCACCTACTTCTATGCCTTCTATCATCACTGCCAAAGCGAAATQAAGCCCAGCTG 

GGCAGATTCGGCCCATGGTGAT6AGGTCCCCTATGTCTTCGGCATCCCCATGATCGGTCCCACCGAGCTCTTCAG 

TTGTAACTTTTCCAAGAACGACGTCATGCTCAGCGCCGTGGTCATGACCTACTGGACGAACTTCGCCAAAACTGG 

TGATCCAAATCAACCAGTTCCTCAGGATACCAAGTTCATTCACACAAAACCCAACCGCTTTGAAGAAGTGGCCTG 

GTCCAAGTATAATCCCAAAGACCAGCTCTATCTGa^TATTGGCTTGAAACCCAGAGTGAGAGATCACTACCGGGC 

AACGAAAGTGGCTTTCTGGTTGGAACTCGTTCCTCATTTGCACAACTTGaACGAGATATTCCAGTATGTTTCAAC 

AACCACaAAGGTTCCTCCACCAGACATGACATCATTTCCCTATGGCACCCGGCGATCTCCCGCCAAGATATG^^ 

AACCACa^CGCCCAGCAATCACTCCTGCCAACAATCCCaAACACTCTAAGGACCCTCACAAAACAGGGC^^ 

GGACACAACTGTCCTCaTTGAAACCAAACGAGATTATTCCACCGAATTAAGTGTCACCATTGCCGTCGGGGCGTC 

GCTCCTCTTCCTCmACATCTTAGCTTTTGCGGCGCTGTACTACaAAAAGGACAAQAGGCGCCATGAGACTCACAG 

GCGCCCCAGTCCCCJkGAGAAACACa^CAAATQATATCGCTCACATCCAGAACGAAGAGATCATGTCTCTGCAGAT 

GAAGCAGCTGOAACACGATCACGAGTGTGAGTCGCTGCAGGCACACGACACACTSAGGCTCaCCTGCCCGCCAGA 

CTACACCOTCACGCTGCGCCGGTCGCCAGATGACATCCCACTTATGACGCCAAAaiCCaTCACCATGAT^^ 

OkCACTGACGGGGATGCAGCCTTTGCACACTTTTAACACCTTCAGTGGAGGACAAAACAGTACAAATTTACCCC^ 

CGGACATTCCACCACTAGAGTATAGCTTTGCCCTATTTCCCTTCCTATCCCTCTGCCCTACCCGCTCAGCAACAT 

AGAAGAGGQAAGGAAAGAGAGAAGGAAAQAGAGAGAGAAAQAAAGTCTCCAGACCaGGAATGTTTTTGTCCCACT 

GACTTAAGACAAAAATGCAAAAAGGOIGTCATCCCATCCCGGCAGACCCTTATCGTTGGTGTTTTCCAGTATT^^ 

AAQATCAACTTCTGACCCTGTGAAATGTGAGAAGTACACATTTCTGTTAAAATAACTGCTTTAAGATCTCTACCA 

CTCO^TCAATGTTTAGTGTGATAGGACATCACCATTTCAAGGCCCCGGGTGTTTCCAACGTCATa 

GACACTTCTGAAACTCAGCCAAGGAGACTTGATATTTTTTAATTACAATGGAAGTTTAAACATTTCTTTCTGTGC 

CACACAATGGATGGCTCTCCTTi^GTGAAGAAAGAGTCAATGAGATTTTGCCCAGCACATGGAGCTGTAATCCAG 

AGAGAAGGAAACGTAGAAATTTATTATTAAAAGAATGGACTGTGCAGCGAAATCTGTACGGTTCTGTGCAAAGAG 

GTGTTTTGCCAGCCTGAACTATATTTAAGAGACTTTGT 



innpiBrrrraTTTi 



FIGURE 151 



MLNSNVLLWLTALAIKFTLIDSQAQYPWNTNYGKIRGLRTPLPNEILGPVEQYLGVPYASP 
PTGERRFQPPEPPSSWTGIRNTTQFAAVCPQHLDERSLLHDMLPIWFTANLDTLMTYVQDQN 
EDCLYmiYVPTEDGJ^TKKNADDITSlSmRGEDEDIHDQNSKKPVMVYIHGGSYMEGTGNMI 
DGSILASYGNVIVITINYRLGILGFLSTGDQAAKGNYGLLDQIQALRWIEENVGAFGGDPKR 
VTIFGSGAGASCVSLLTLSHYSEGLFQKAIIQSGTALSSWAVNYQPAKYTRILADKVGCNML 
DTTDMVECLRNKNYKELIQQTITPATYHIAFGPVIDGDVIPDDPQILMEQGEFLNYDIMLGV 
NQGEGLKFVDGIVDNEDGVTPNDFDFSVSNFVDNLYGYPEGKDTLRETIKFMYTDWADKENP 
ETRRKTLVALFTDHQWVAPAVAADLHAQYGSPTYFYAFYHHCQSEMKPSWADSAHGDEVPYV 
FGI PMI GPTEIiFSCNFSKNDVMLSAWMTYWTNFAKTGDPNQPVPQDTKF IHTKPNRFEEVA 
WSKYNPKDQLYLHIGLKPRVRDHYRATKVAFWLELVPHLHNLNEIFQYVSTTTKVPPPDMTS 
FPYGTRRSPAKIWPTTKRPAITPANNPKHSKDPHKTGPEDTTVLIETKRDYSTELSVTIAVG 
ASLLFLNIIAFAALYYKKDKRRHETHRRPSPQRNTTNDIAHIQNEEIMSLQMKQLEHDHECE 
SLQAHDTLRLTCPPDYTLTLRRSPDDIPLMTPNTITMIPNTLTGMQPLHTFNTFSGGQNSTN 
LPHGHSTTRV 

Signal sequence; 

amino acids 1-24 

Transmembrane domains: 

amino acids 189-204, 675-692 



FIGURE 152 

GGGAAAGATGGCGGCGACTCTGGGACCCCTTGGGTCGTGGCAGCAGTGGCGGCGATGTTTGT 
CGGCTCGGGATGGGTCCAGGATGTTACTCCTTCTTCTTTTGTTGGGGTCTGGGCAGGGGCCA 
CAGCAAGTCGGGGCGGGTCAAACGTTCGAGTACTTGAAACGGGAGCACTCGCTGTCGAAGCC 
CTACCAGGGTGTGGGCACAGGCAGTTCCTCACTGTGGAATCTGATGGGCAATGCCATGGTGA 
TGACCCAGTATATCCGCCTTACCCCAGATATGCAAAGTAAACAGGGTGCCTTGTGGAACCGG 
GTGCCATGTTTCCTGAGAGACTGGGAGTTGCAGGTGCACTTCAAAATCCATGGACAAGGAAA 
GAAGAATCTGCATGGGGATGGCTTGGCAATCTGGTACACAAAGGATCGGATGCAGCCAGGGC 
CTGTGTTTGGAAACATGGACAAATTTGTGGGGCTGGGAGTATTTGTAGACACCTACCCCAAT 
GAGGAGAAGCAGCAAGAGCGGGTATTCCCCTACATCTCAGCCATGGTGAACAACGGCTCCCT 
CAGCTATGATCATGAGCGGGATGGGCGGCCTACAGAGCTGGGAGGCTGCACAGCCATTGTCC 
GCAATCTTCATTACGACACCTTCCTGGTGATTCGCTACGTCAAGAGGCATTTGACGATAATG 
ATGGATATTGATGGCAAGCATGAGTGGAGGGACTGCATTGAAGTGCCCGGAGTCCGCCTGCC 
CCGCGGCTACTACTTCGGCACCTCCTCCATCACTGGG6ATCTCTCAGATAATCATGATGTCA 
TTTCCTTGAAGTTGTTTGAACTGACAGTGGAGAGAACCCCAGAAGAGGAAAAGCTCCATCGA 
GATGTGTTCTTGCCCTCAGTGGACAATATGAAGCTGCCTGAGATGACAGCTCCACTGCCGCC 
CCTGAGTGGCCTGGCCCTCTTCCTCATCGTCTTTTTCTCCCTGGTGTTTTCTGTATTTGCCA 
TAGTCATTGGTATCATACTCTACAACAAATGGCAGGAACAGAGCCGAAAGCGCTTCTACTGA 
GCCCTCCTGCTGCCACCACTTTTGTGACTGTCACCCATGAGGTATGGAAGGAGCAGGCACTG 
GCCTGAGCATGCAGCCTGGAGAGTGTTCTTGTCTCTAGCAGCTGGTTGGGGACTATATTCTG 
TCACTGGAGTTTTGAATGCAGGGACCCCGCATTCCCATGGTTGTGCATGGGGACATCTAACT 
CTGGTCTGGGAAGCCACCCACCCCAGGGCAATGCTGCTGTGATGTGCCTTTCCCTGCAGTCC 
TTCCATGTGGGAGCAGAGGTGTGAAGAGAATTTACGTGGTTGTGATGCCAAAATCACAGAAC 
AGAATTTCATAGCCCAGGCTGCCGTGTTGTTTGACTCAGAAGGCCCTTCTACTTCAGTTTTG 
AATCCACAAAGAATTAAAAACTGGTAACACCACAGGCTTTCTGACCATCCATTCGTTGGGTT 
TTGCATTTGACCCAACCCTCTGCCTACCTGAGGAGCTTTCTTTGGAAACCAGGATGGAAACT 
TCTTCCCTGCCTTACCTTCCTTTCACTCCATTCATTGTCCTCTCTGTGTGCAACCTGAGCTG 
GGAAAGGCATTTGGATGCCTCTCTGTTGGGGCCTGGGGCTGCAGAACACACCTGCGTTTCAC 
TGGCCTTCATTAGGTGGCCCTAGGGAGATGGCTTTCTGCTTTGGATCACTGTTCCCTAGCAT 
GGGTCTTGGGTCTATTGGCATGTCCATGGCCTTCCCAATCAAGTCTCTTCAGGCCCTCAGTG 
AAGTTTGGCTAAAGGTTGGTGTAAAAATCAAGAGAAGCCTGGAAGACATCATGGATGCCATG 
GATTAGCTGTGCAACTGACCAGCTCCAGGTTTGATCAAACCAAAAGCAACATTTGTCATGTG 
GTCTGACCATGTGGAGATGTTTCTGGACTTGCTAGAGCCTGCTTAGCTGCATGTTTTGTAGT 
TACGATTTTTGGAATCCCACTTTGAGTGCTGAAAGTGTAAGGAAGCTTTCTTCTTACACCTT 

TGCTGTTCTCATGTTCCAAGTCTGAGAGCAACAGACCCTCATCATCTGTGCCTGGAAGAGTT 
CACTGTCATTGAGCAGCACAGCCTGAGTGCTGGCCTCTGTCAACCCTTATTCCACTGCCTTA 
TTTGACAAGGGGTTACATGCTGCTCACCTTACTGCCCTGGGATTAAATCAGTTACAGGCCA6 
AGTCTCCTTGGAGGGCCTGGAACTCTGAGTCCTCCTATGAACCTCTGTAGCCTAAATGAAAT 
TCTTAAAATCACCGATGGAACCAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTCG 
ACCTGCAGTAGGGATAACAGGGTAATAAGCTTGGCCGCCATGG 



FIGURE 153 



>< /us r / seqdb2 / s s t /DNA/Dnas eqs . min/ s s . DNA5 0 9 1 1 
xsubunit 1 of .1, 348 aa, 1 stop 
xMW: 39711, pi: 8.70, NX(S/T): 1 

MAATLGPLGSWQQWRRCLSARDGSRMLLLLLLLGSGQGPQQVGAGQTFEYLKREHSLSKPYQ 
GVGTGSSSLWNLMGNAMVMTQYIRLTPDMQSKQGALWNRVPCFLRDWELQVHFKIHGQGKKN 
LHGDGIiAIWYTKDRMQPGPVFGN^IDKFVGLGVFVDTYPNEEKQQERVFPYISAMVNNGSLSY 
DHERDGRPTELGGCTAIVRNLHYDTFLVIRYVKRHLTIMMDIDGKHEWRDCIEVPGVRLPRG 
YYFGTSSITGDLSDNHDVISLKLFELTVERTPEEEKLHRDVFLPSVDNMKLPEMTAPLPPLS 
GLALFLIVFFSLVFSVFAIVIGIILYNKWQEQSRKRFY 

Signal sequence: 

amino acids 1-3 8 



Transmembrane domain: 

amino acids 310-329 



FIGURE 154 



CCGAGCCGGGCGCGai.GCGACGGAGCTGGGGCCGGCCTG(3GACCaTGGGCGTGAGTGaUiTCTACGGA.TCAGTCT 

CTGATGGTGGGTCGTTAACCTCAGTGGGGACTCCAAGATTTCCATGAAGAAAATCaGTTGTCTTCATTCAAGaA^ 

TGGGGTCTGGCTCAGAATTCCTGCAGCTGGTGAAiyiTCTGTTTTCTAGAAGAGGTTTAATTAATGCCTGCAGTCT 

GACaTGTTCCCGATTTGAGGTGAAACCATGAAGAGAaAATAGAATACTTAATAATCCTTTTCCGC^ 

TGCTGCTGCTGGCCCTGGCTGCGCTGCTGGCCTTTGTGAGCCTCAGCCTGCAGTTCTTCCACCTGATCCCGGTGT 

CGACTCCTAAGAATGGAAT6AGTAGCAAQAGTCGAAAGAGAATCATGCCCGACCCTGTGACGGAGCCCCCTGTGA 

CAQACCCCGTTTATGAAGCTCTTTTGTACTGCAACATCCCCaGTGTGGCCQAGCGCAGCATGGAAGGTCATGCCC 

CGCATCaTTTTAAGCTGGTOTCSlGTGCATGTGTTCATTCGCCy^CGGAGACAGGTACCCACTGTATGTCATTCC 

AAACAAAGCGACCAGAAATTGACTGCACTCTGGTGGCTAACAGGAAACCGTATCACCC?^AAACTGGAAGCTTTC^ 

TTAGTCACATGTCAAAAGGATCCGGAGCCTCTTTCGAAAGCCCCTTGAACTCCTTGCCTCTTTACCCAAATCACC 

CATTGTGTGAGATGGGaGAGCTCACACAGaCAGQAGTTGTGCAGCATTTGCAGAACGGTCAGCTGCTGAGGGAT^ 

TCTATCTAAAGAAACACAAACTCCTGCCCAATGATTGGTCTGCAGACCAGCTCTATTTAGAGACCACTGGGAAAA 

GCCGGACCCTACAAAGTGGGCTGGCCTTGCTTTATGGCTTTCTCCCAGATTTTGACTGGAAGAAGATTTATTTCA 

GGCACCAGCCAAGTGCGCTGTTCTGCTCTGGAAGCTGCTATTGCCCGGTAAGAAACCAGTATCTGGAAAAGGAGC 

AGCGTCGTCAGTACCTCCTACGTTTGAAAAACyiGCCAGCTGGAGAAGACOTACGGGGAQATGGCCAAGATCGTGG 

ATGTCCCCa^CCAAGCAGCTTAGAGCTGCCAACCCCATAGACTCCATGCTCTGCCACTTCTGCCACAATG^ 

TTCCCTGTACCAGAAATGGCTGTGTTGACATGGAGCACTTCaAGGTAATTAAGACCCATCAGATCGAGGaTGA^ 

GGGAAAGACGGGAGAAGAAATTGTACTTCGGGTATTCTCTCCTGGGTGCCCACCCCATCCTGAACCaAACCATCG 

GCCGGATGCAGCGTGCCACCGAGGGCAGGAAAGAAGAGCTCTTTGCCCTCTACTCTGCTCATGATGTCACTCTGT 

CACCAGTTCTCAGTGCCTTGGGCCTTTCAGAAGCCAGGTTCCCAAGGTTTGCAGCCAGGTTGATCTTTGAGCT^ 

GGCAAGACAGAGAAAAGCCCAGTGAACATTCCGTCCGGATTCTTTACAATGGCGTCGATGTCACATTCCACACCT 

CTTTCTGCCaAGACCaCCACAAGCGTTCTCCCAAGCCCATGTGCCCGCTTGAAAACTTGGTCCGCTTTGTGAAA^ 

GGGACATGTTTGTAGCCCTGGGTGGCAGTGGTACAAATTATTATGATGCATGTCACAGGGAAGGATTCTAAAAGG 

TATGC:AGTAC:AGCAGTATAGAATCCa.TGCCAATACU^GAGCATAGGGAAAGGTCCACTTCTAGTTTTGTCTGTTAC 

TAAGGGTAGAAGATTATTGCTTTTTAAAGGCTAAATATTGTTTGTGGGAACCACAGATGGTTGGGGTTGAACAGT 

AAGCACATTGCTGCAATGTGGTACGTGAATTGCTTGGTACAAAATGGCCAGTTCaCAGAGGAATAGAAGGTACTT 

TATCATAGCCAGACTTCGCTTAGAATGCCAGAATAATATAGTTCAAGACCTGAAGTTGCCAATCCAAGTTTGCAC 

TCTTCTGGCCTGCCCCATGTTACTATGTGATGGAACCaGCACaCCTCAACCTUSJ^TTTTTTTAATCTTAGACA 

TTTACCTTGTCCTTGTTAAGAATTTCTTGAAGTGATTTATCTAAAATAAAGGTTGGCAAACTTTTTCTGTAAAGG 

GCCAGATTGTAAATATTTO^GACTGTGTGGACCaAAAGGCCaLCATACAGTCTCTGTCATAACTACTO^ 

TTCTGAAGCAGGAAAGCCACCACaGACAGTACSiTAAAGGAATATGTGTAGCTGGGTTCCCAGGCCAGACAAAACA 

GATGGTGACCAGACTTGGCCCCTGGGCTGTAGTTTGCTGACCCCTCATCTAAAAAATAGGCTATACTACAATTGC 

ACTTCCAGCACTTTQAGAACGAGTTGAATACCAAGAATTATTCAATGGTTCCTCCAGTAACTTCTGCTAGAAACA 

CAGAATTTGGTCTGTATCTGACACTAGAACaAAACTTGAGGGTAAATAAACa^TTGAATTAGaATGAATCATA^ 

AACTGATTAGAAGAATACTTGATGTTTATGATQATTGTGGTACAAGATAGTTTTAAGTATGTTCTAAATATTTGT 

CTGCTGTAGTCTATTTGCTGTATATGCTGAAATTTTTGTATGCCATTTAGTATTTTTATAGTTTAGGAAAATATT 

TTCTAAGACCAGTTTTAGATGACTCTTATTCCTGTAGTAATATTCAATTTGCTGTACCTGCTTGGTGGTTAGAAG 

GAGGCTAGAAGATGAATTCAGGCACTTTCTTCCS^TAAAACTAATTATGGCTCATTCCCTTTGACAAGCTGTAGA 

ACTGGATTCATTTTTAAACCATTTTCATCAGTTTCAAATGGTAAATTCTGATTGATTTTTAAATGCGTTTTTGGA 

AGAACTTTGCTATTAGGTAGTTTACAGATCTTTATAAGGTGTTTTATATATTAQAAGCAATTATAATTACATCTG 

TGATTTCTGAACTAATGGTGCTAATTCAGAGAAATGGA?^GTGAAAGTGAGATTCTCTGTTGTCATCGGCATTCC 

AACTTTTTCTCTTTGTTTTTGTCCAGTGTTGCATTTGAATATGTCTGTTTCTATAAATAAATTTTTTAAGAATAA 



FIGURE 155 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA48329 
xstibunit 1 of 1, 480 aa, 1 stop 
xMW: 55240, pi: 9.30, NX(S/T): 2 

MLFKNRFLLLLALAALLAFVSLSLQFFHL I PVSTPKNGMS SKSRKRIMPDPVTEPPVTDPVY 
EALLYCNIPSVAERSMEGHAPHHFKLVSVHVFIRHGDRYPLYVI PKTKRPE IDCTLVANRKP 
YHPKLEAFISHMSKGSGASFESPLNSLPLYPNHPLCEMGEIiTQTGWQHLQNGQLLRDIYLK 
KHKLLPNDWSADQLYLETTGKSRTLQSGLALLYGFLPDFDWKKIYFRHQPSALFCSGSCYCP 
VRNQYLEKEQRRQYLLRLKNSQLEKTYGEMAKIVDVPTKQLRAANPIDSMLCHFCHNVSFPC 
TRNGCVDMEHFKVIKTHQIEDERERREKKLYFGYSLLGAHPILNQTIGRMQRATEGRKEELF 
ALYSAHDVTLSPVLSALGLSEARFPRFAARLIFELWQDREKPSEHSVRILYNGVDVTFHTSP 
CQDHHKRSPKPMCPLENLVRFVKRDMFVALGGSGTNYYDACHREGF 



Signal sequence: 

amino acids 1-18 



FIGURE 156 

AAAAAAGCrcaCTAAAGTTTCTATTAGAGCGAATACGGTAGATTTCCATCCCCTTTTGAAGAACAGTACTGTGGA 

GCTATTTAAGAGATAAAAACGAAATATCCTTTCTGGGAGTTCAAGATTGTGCAGTAATTGGTTAGGACTCTGAGC 

GCCGCTGTTCACCAATCGGGGAGAGaAAAGCGGAGATCCTGCTCGCCTTGCACGCGCCTGAAGaiCS^GC^ 

AGCTAGGAATGAACCATCCCTGGGAGTATGTGGAAACAACGGAGGAGCTCTGACTTCCCAACTGTCCCATTCTAT 

GGGCGAAGGaACTGCTCCTGACTTCAGTGGTTAAGGGCAGAATTGAAAATAATTCTQGAGGAAGATAAGAATGAT 

TCCTGCGCGACTGCACCGGGACTACAAAGGGCTTGTCCTGCTGGGAATCCTCCTGGGGACTCTGTGGGAGACCGG 

ATGCACCCAQATACGCTATTCAGTTCCG6AAGAGCTGGAGAAAGGCTCTAGGGTGGGCGACATCTCCAGGGACCT 

GGGGCTGGA6CCCCGGGAGCTCGCGGAGCGCGGAGTCCGCATCATCCCCAGAGGTAGGACGCAGCTTTTCGCCCT 

GAATCCGCGCAGCGGCAGCTTGGTCACGGCGGGCAGGATAGACCGGGAGGAGCTCTGTATG6GGGCCATCAAGTG 

TCAATTAAATCTAGACATTCTGATGGAGGATAAAGTGAAAATATATGGAGTAGAAGTAGAAGTAAGGGACATTAA 

CGACaATGCGCCTTACTTTCGTGAAAGTGAATTAGAAATAAAAATTAGTGAAAATGCAGCCACTGAGATGCGGTT 

CCCTCTACCCCACGCCTGGGATCCGGATATCGGGAAGAACTCTCTGCAGAGCTACGAGCTCAGCCCGAACACTCA 

CTTCTCCCTCATCGTGCAAAATGGAGCCGACGGTAGTAAGTACCCCGAATTGGTGCTGAAACGCGCCCTGGACCG 

CGAAGAAAAGGCTGCTC».CCACCTGGTCCTTACGGCCTCCGACGGGGGCGACCCGGTGCGCACAGGCACCGCGCG 

CATCCGCGTGATGGTTCTGGATGCGAACGACAACGCACCAGCGTTTGCTCAGCCCGAGTACCGCGCGAGCGTTCC 

GGAGAATCTGGCCTTGGGCACGCAGCTGCTTGTAGTCAACGCTACCGACCCTGACGAAGGAGTCAATGCGGAAGT 

GAGGTATTCCTTCCGGTATGTGGACGACAAGGCGGCCCAAGTTTTCAAACTAGATTGTAATTCAGGGACAATATC 

AACAATAGGGGAGTTGGACCaCGAGGAGTOlGGATTCTACCAGATGGAAGTGCAAGCaATGGATAATGCAGGAT^ 

TTCTGCGCGAGCCAAAGTCCTGATCACTGTTCTGGACGTGAACGACAATGCCCCAGAAGTGGTCCTCACCTCTCT 

CGCCAGCTCGGTTCCCGAAAACTCTCCCAGAGGGACATTAATTGCCCTTTTAAATGTAAATGACCAAGATTCTGA 

GGAAAACGGACAGGTGATCTGTTTCATCCAAGGAAATCTGCCCTTTAAATTAGAAAAATCTTACGGAAATTACTA 

TAGTTTAGTCACAGACATAGTCTTGGATAGGGAACAGGTTCCTAGCTACAACATCACAGTGACCGCCACTGACCG 

GGGAACCCCGCCCCTATCCACGGAAACTCATATCTCGCTGAACGTGGCAGACaCCAACQACaACCCGCCGGTCTT 

CCCTCAGGCCTCCTATTCCGCTTATATCCCAGAGAACAATCCCAGAGGAGTTTCCCTCGTCTCTGTGACCGCCCA 

CGACCCCGACTGTQAAGAGaACGCCCAGATOiCTTATTCCCTGGCTGAGAACACCATCa^GGGGC^ 

GTCCTACGTGTCCATCAACTCCGACACTGGGGTACTGTATGCGCTGAGCTCCTTCGACTACQAGCAGTTCCGAGA 

CTTGCAAGTGAAAGTGATGGCGCGGGACAACGGGCACCCGCCCCTCAGCAGCAACGTGTCGTTGAGCCTGTTCGT 

GCTGGACCAGAACGACAATGCGCCCGAGATCCTGTACCCCGCCCTCCCCACGGACGGTTCCACTGGCGTGQAGCT 

GGCTCCCCGCTCCGCAGAGCCCGGCTACCTGGTGACCAAGGTGGTGGCGGTGGACAGAGACTCCGGCCAGaACGC 

CTGGCTGTCCTACCGTCTGCTCaAGGCCAGCGAGCCGGGACTCTTCTCGGTGGGTCTGCa.CACGGGCGAGGTGCG 

CACGGCGCGAGCCCTGCTGGACAGAGACGCGCTCyiAGCAGAGCCTCGTAGTGGCCGTCCa^GQACCACGGCC^ 

CCCTCTCTCCGCCACTGTCACGCTCACCGTGGCCGTGGCCGACAGCATCCCCCAAGTCCTGGCGGACCTCGGCAG 

CCTCGAGTCTCCa.GCTAACTCTGAAACCTCAGACCTC:aCTCTGTACCTGGTGGTAGCGGTGGCCGCGGTCTCCTG 

CGTCTTCCTGGCCTTCGTCATCTTGCTGCTGGCGCTCAGGCTGCGGCGCTGGCACAAGTCACGCCTGCTGCAGGC 

TTCAGGAGGCGGCTTGACAGGAGCGCCGGCGTCGCACTTTGTGGGCGTGGACGGGGTGCAGGCTTTCCTGCAGAC 

CTATTCCCACGAGGTTTCCCTCACCACGGACTCGCGGAAGAGTCACCTGATCTTCCCCCAGCCCAACTATGCAGA 

CATGCTCGTCAGCCAGGAQAGCTTTGAAAAAAGCGAGCCCCTTTTGCTGTCAGGTGATTCGGTATTTTCTAAAGA 

CAGTCATGGGTTAATTGAGGTGAGTTTATATCyUiATCTTCTTTCTTTTTTTTTTTAATTGCTCTGTCTCCCAAGC 

TGGAGTGCAGCGGTACGATCATAGCTCaLCTGCGGCCTCAAACTCCTAGGCTCAAGCaATTATCCCACCTTT^^ 

CCGGTGTAACAGGGACTACAGGTGCAAGCCACCTACTGTCTGCCTATCTATCTATCTATCTATCTATCTATCTAT 

CTATCTATCTATCTATCTATTACTTTCTTGTACAGACGGGAGTCTCACGCCTGTAATCCCAGTACTTTGGGAGGC 

CGAGGCGGGTGGATCACCTGAGGTTGGGAGTTTGAGACCAGCCT6ACCAACATGGAQAAACCCCGTCTATACTAA 

AAAAATACAAAATTAGCCGGGCGTGGTGGTGCATGTCTGTAATCCCAGCTACTTGGGAGGCTGAGTCAGQAGAAT 

TGCTTTAACCTGGGAGGTGGAGGTTGCAATGAGCTGAGATTGTGCCATTGCACTCCAGCCTGGGCAACAAGA^ 



FIGURE 157 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA48306 
xsubunit 1 of 1, 916 aa, 1 stop 
xMW: 100204, pi: 4.92^ NX(S/T): 4 

MIPARLHRDYKGLVLLGILLGTLWETGCTQIRYSVPEELEKGSRVGDISRDLGLEPRELAER 
GVRI I PR6RTQLFALNPRSGSLVTAGRIDREELCMGAIKCQLNLDILMEDKVKIYGVEVEVR 
DINDNAPYFRESELEIKISENAATEMRFPLPHAWDPDIGKNSLQSYELSPNTHFSLIVQNGA 
DGSKYPELVLKRALDREEKAAHHLVLTASDGGDPWTGTARIRVMVIJDANDNAPAPAQPEYR 
ASVPENLALGTQLLWNATDPDEGVNAEVRYSFRYVDDKAAQVFKLDCNSGTISTIGELDHE 
ESGFYQMEVQAlTONAGYSARAKVLITVLDViraNAPEWLTSLASSVPENSPRGTLIALLNVN 
DQDSEENGQVICFIQGNLPFKLEKSYGNYYSLVTDIVLDREQVPSYNITVTATDRGTPPLST 
ETHISIiNVADTNDNPPVFPQASYSAYIPENNPRGVSLVSVTAHDPDCEENAQITYSLAENTI 
QGASLSSYVSINSDTGVLYALSSFDYEQFRDLQVKVMARDNGHPPLSSNVSLSLFVLDQNDN 
APEILYPALPTDGSTGVELAPRSAEPGYLVTKWAVDRDSGQNAWLSYRLLKASEPGLFSVG 
LHTGEWTARALLDRDALKQSLWAVQDHGQPPLSATVTLTVAVADSIPQVIxADLGSLESPA 
NSETSDLTLYLWAVAAVSCVFLAFVILLLALRLRRWHKSRLLQASGGGLTGAPASHFVGVD 
GVQAFLQTYSHEVSLTTDSRKSHLIFPQPNYADMLVSQESFEKSEPLLLSGDSVFSKDSHGL 
lEVSLYQIFFLFFFNCSVSQAGVQRYDHSSLRPQTPRLKQLSHLCLRCNRDYRCKPPTVCLS 
lYLSIYLSIYLSIYLLLSCTDGSLTPVIPVLWEAEAGGSPEVGSLRPA 

Signal sequence: 

amino acids 1-3 0 

Transmembrane domains : 

amino acids 693-711, 809-823, 869-888 



FIGURE 158 



CCCAGGCTCTAGTGCAGGAGGAGAAGGAGGAGGAGCAGGAGGTGGAGATTCCCAGTTAAAAG 
GCTCCAGAATCGTGTACCAGGCAGAGAACTGAAGTACTGGGGCCTCCTCCACTGGGTCCGAA 
TCAGTAGGTGACCCCGCCGCTGGATTCTGGAAGACCTCACC ATGG GACGCCCCCGACCTCGT 
GCGGCCAAGACGTGGATGTTCCTGCTCTTGCTGGGGGGAGCCTGGGCAGGACACTCCAGGGC 
ACAGGAGGACAAGGTGCTGGGGGGTCATGAGTGCCAACCCCATTCGCAGCCTTGGCAGGCGG 
CCTTGTTCCAGGGCCAGCAACTACTCTGTGGCGGTGTCCTTGTAGGTGGCAACTGGGTCCTT 
ACAGCTGCCCACTGTAAAAAACCGAAATACACAGTACGCCTGGGAGACCACAGCCTACAGAA 
TAAAGATGGCCCAGAGCAAGAAATACCTGTGGTTCAGTCCATCCCACACCCCTGCTACAACA 
GCAGCGATGTGGAGGACCACAACCATGATCTGATGCTTCTTCAACTGCGTGACCAGGCATCC 
CTGGGGTCCAAAGTGAAGCCCATCAGCCTGGCAGATCATTGCACCCAGCCTGGCCAGAAGTG 
CACCGTCTCAGGCTGGGGCACTGTCACCAGTCCCCGAGAGAATTTTCCTGACACTCTCAACT 
GTGCAGAAGTAAAAATCTTTCCCCAGAAGAAGTGTGAGGATGCTTACCCGGGGCAGATCACA 
GATGGCATGGTCTGTGCAGGCAGCAGCAAAGGGGCTGACACGTGCCAGGGCGATTCTGGAGG 
CCCCCTGGTGTGTGATGGTGCACTCCAGGGCATCACATCCTGGGGCTCAGACCCCTGTGGGA 
GGTCCGACAAACCTGGCGTCTATACCAACATCTGCCGCTACCTGGACTGGATCAAGAAGATC 
ATAGGCAGCAAGGGCTGATTCTAGGATAAGCACTAGATCTCCCTTAATAAACTCACAACTCT 
CTGGTTC 
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FIGURE 159 

< /us r / seqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA4 8 3 3 € 
<subunit 1 of 1, 260 aa, 1 stop 
<MW: 28048, pi: 7.87, IIX(S/T): 1 

MGRPRPRAAKTWMFLLLLGGAWAGHSRAQEDKVLGGHECQPHSQPWQAALFQGQQLLCGGVL 
VGGNWVLTAAHCKKPKYTVRLGDHSLQNKDGPEQEIPWQSIPHPCYNSSDVEDHNHDLMLL 
QLRDQASLGSKVKPISLADHCTQPGQKCTVSGWGTVTSPRENFPDTLNCAEVKIFPQKKCED 
AYPGQITDGMVCAGSSKGADTCQGDSGGPLVCDGALQGITSWGSDPCGRSDKPGVYTNICRY 
LDWIKKIIGSKG 

Important Features: 
Signal peptide: 

amino acids 1-23 

Transmembrane domain: 

amino acids 51-71 

N-glycosylation site. 

amino acids 110-113 

Serine proteases, trypsin family, histidine active site. 

amino acids 69-74 and 207-217 

Tyrosine kinase phosphorylation site. 

amino acids 182-188 

Kringle domain proteins motif 

amino acids 205-217 



FIGURE 160 

GGCGCCGGTGCACCGGGCGGGCTGAGCGCCTCCTGCGGCCCGGCCTGCGCGCCCCGGCCCGC 
CGCGCCGCCCACGCCCCAACCCCGGCCCGCGCCCCCTAGCCCCCGCCCGGGCCCGCGCCCGC 
GCCCGCGCCCAGGTGAGCGCTCCGCCCGCCGCGAGGCCCCGCCCCGGCCCGCCCCCGCCCCG 
CCCCGGCCGGCGGGGGAACCGGGCGGATTCCTCGCGCGTCAAACCACCTGATCCCATAAAAC 
ATTCATCCTCCCGGCGGCCCGCGCTGCGAGCGCCCCGCCAGTCCGCGCCGCCGCCGCCCTCG 
CCCTGTGCGCCCTGCGCGCCCTGCGCACCCGCGGCCCGAGCCCAGCCAGAGCCGGGCGGAGC 
GGAGCGCGCCGAGCCTCGTCCCGCGGCCGGGCCGGGGCCGGGCCGTAGCGGCGGCGCCTGGA 
TGCGGACCCGGCCGCGGGGAGACGGGCGCCCGCCCCGAAACGACTTTCAGTCCCCGACGCGC 
CCCGCCCAACCCCTACGATGAAGAGGGCGTCCGCTGGAGGGAGCCGGCTGCTGGCATGGGTG 
CTGTGGCTGCAGGCCTGGCAGGTGGCAGCCCCATGCCCAGGTGCCTGCGTATGCTACAATGA 
GCCCAAGGTGACGACAAGCTGCCCCCAGCAGGGCCTGCAGGCTGTGCCCGTGGGCATCCCTG 
CTGCCAGCCAGCGCATCTTCCTGCACGGCAACCGCATCTCGCATGTGCCAGCTGCCAGCTTC 
CGTGCCTGCCGCAACCTCACCATCCTGTGGCTGCACTCGAATGTGCTGGCCCGAATTGATGC 
GGCTGCCTTCACTGGCCTGGCCCTCCTGGAGCAGCTGGACCTCAGCGATAATGCACAGCTCC 
GGTCTGTGGACCCTGCCACATTCCACGGCCTGGGCCGCCTACACACGCTGCACCTGGACCGC 
TGCGGCCTGCAGGAGCTGGGCCCGGGGCTGTTCCGCGGCCTGGCTGCCCTGCAGTACCTCTA 
CCTGCAGGACAACGCGCTGCAGGCACTGCCTGATGACACCTTCCGCGACCTGGGCAACCTCA 
CACACCTCTTCCTGCACGGCAACCGCATCTCCAGCGTGCCCGAGCGCGCCTTCCGTGGGCTG 
CACAGCCTCGACCGTCTCCTACTGCACCAGAACCGCGTGGCCCATGTGCACCCGCATGCCTT 
CCGTGACCTTGGCCGCCTCATGACACTCTATCTGTTTGCCAACAATCTATCAGCGCTGCCCA 
CTGAGGCCCTGGCCCCCCTGCGTGCCCTGCAGTACCTGAGGCTCAACGACAACCCCTGGGTG 
TGTGACTGCCGGGCACGCCCACTCTGGGCCTGGCTGCAGAAGTTCCGCGGCTCCTCCTCCGA 
GGTGCCCTGCAGCCTCCCGCAACGCCTGGCTGGCCGTGACCTCAAACGCCTAGCTGCCAATG 
ACCTGCAGGGCTGCGCTGTGGCCACCGGCCCTTACCATCCCATCTGGACCGGCAGGGCCACC 
GATGAGGAGCCGCTGGGGCTTCCCAAGTGCTGCCAGCCAGATGCCGCTGACAAGGCCTCAGT 
ACTGGAGCCTGGAAGACCAGCTTCGGCAGGCAATGCGCTGAAGGGACGCGTGCCGCCCGGTG 
ACAGCCCGCCGGGCAACGGCTCTGGCCCACGGCACATCAATGACTCACCCTTTGGGACTCTG 
CCTGGCTCTGCTGAGCCCCCGCTCACTGCAGTGCGGCCCGAGGGCTCCGAGCCACCAGGGTT 
CCCCACCTCGGGCCCTCGCCGGAGGCCAGGCTGTTCACGCAAGAACCGCACCCGCAGCCACT 
GCCGTCTGGGCCAGGCAGGCAGCGGGGGTGGCGGGACTGGTGACTCAGAAGGCTCAGGTGCC 
CTACCCAGCCTCACCTGCAGCCTCACCCCCCTGGGCCTGGCGCTGGTGCTGTGGACAGTGCT 
TGGGCCCTGCTGACCCCCAGCGGACACAAGAGCGTGCTCAGCAGCCAGGTGTGTGTACATAC 
GGGGTCTCTCTCCACGCCGCCAAGCCAGCCGGGCGGCCGACCCGTGGGGCAGGCCAGGCCAG 
GTCCTCCCTGATGGACGCCTGCCGCCCGCCACCCCCATCTCCACCCCATCATGTTTACAGGG 
TTCGGCGGCAGCGTTTGTTCCAGAACGCCGCCTCCCACCCAGATCGCGGTATATAGAGATAT 
GCATTTTATTTTACTTGTGTAAAAATATCGGACGACGTGGAATAAAGAGCTCTTTTCTTAAA 
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FIGURE 161 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA44184 
xsubunit 1 of 1, 473 aa, 1 stop 
xMW: 50708, pi: 9.28, NX{S/T): 6 

MKRASAGGSRLIAWVLWLQAWQVAAPCPGACVCYNEPKVTTSCPQQGLQAVPVGIPAASQRI 
FLHGNRISHVPAASFRACRNLTILWLHSNVLARIDAAAFTGLALLEQLDLSDNAQLRSVDPA 
TFHGLGRLHTLHLDRCGLQELGPGLFRGLAALQYLYLQDNALQALPDDTFRDLGNLTHLFLH 
GNR I S S VPERAFRGLHSLDRLLLHQNRVAHVHPHAFRDLGRLMTLYLFANNLSALPTEALAP 
LRALQYLRLIIDNPWVCDCRARPLWAWLQKFRGSSSEVPCSLPQRLAGRDLKRLAANDLQGCA 
VATGPYHPIWTGRATDEEPLGLPKCCQPDAADKASVLEPGRPASAGNALKGRVPPGDSPPGN 
GSGPRHINDSPFGTLPGSAEPPLTAVRPEGSEPPGFPTSGPRRRPGCSRKNRTRSHCRLGQA 
GSGGGGTGDSEGSGALPSLTCSLTPLGLALVLWTVLGPC 

Important features : 
Signal peptide: 

amino acids 1-26 

Leucine zipper pattern. 

amino acids 135-156 

Glycosaminoglycan attachment site. 

amino acids 436-439 

N-glycosylation site. 

amino acids 82-85, 179-183, 237-240, 372-375 and 423-426 

VWFC domain 

amino acids 411-425 



FIGURE 162 

GGAAGTCCACGGGGAGCTTGGATGCCAAAGGGAGGACGGCTGGGTCCTCTGGAGAGGACTAC 
TCACTGGCATATTTCTGAGGTATCTGTAGAATAACCACAGCCTCAGATACTGGGGACTTTAC 
AGTCCCACAGAACCGTCCTCCCAGGAAGCTGAATCCAGCAAGAACAATGGAGGCCAGCGGGA 
AGCTCATTTGCAGACAAAGGCAAGTCCTTTTTTCCTTTCTCCTTTTGGGCTTATCTCTGGCG 
GGCGCGGCGGAACCTAGAAGCTATTCTGTGGTGGAGGAAACTGAGGGCAGCTCCTTTGTCAC 
CAATTTAGCAAAGGACCTGGGTCTGGAGCAGAGGGAATTCTCCAGGCGGGGGGTTAGGGTTG 
TTTCCAGAGGGAACAAACTACATTTGCAGCTCAATCAGGAGACCGCGGATTTGTTGCTAAAT 
GAGAAATTGGACCGTGAGGATCTGTGCGGTCACACAGAGCCCTGTGTGCTACGTTTCCAAGT 
GTTGCTAGAGAGTCCCTTCGAGTTTTTTCAAGCTGAGCTGCAAGTAATAGACATAAACGACC 
ACTCTCCAGTATTTCTGGACAAACAAATGTTGGTGAAAGTATCAGAGAGCAGTCCTCCTGGG 
ACTACGTTTCCTCTGAAGAATGCCGAAGACTTAGATGTAGGCCAAAACAATATTGAGAACTA 
TATAATCAGCCCCAACTCCTATTTTCGGGTCCTCACCCGCAAACGCAGTGATGGCAGGAAAT 
ACCCAGAGCTGGTGCTGGACAAAGCGCTGGACCGAGAGGAAGAAGCTGAGCTCAGGTTAACA 
CTCACAGCACTGGATGGTGGCTCTCCGCCCAGATCTGGCACTGCTCAGGTCTACATCGAAGT 
CCTGGAT6TCAACGATAATGCCCCTGAATTTGAGCAGCCTTTCTATAGAGTGCAGATCTCTG 
AGGACAGTCCGGTAGGCTTCCTGGTTGTGAAGGTCTCTGCCACGGATGTAGACACAGGAGTC 
AACGGAGAGATTTCCTATTCACTTTTCCAAGCTTCAGAAGAGATTGGCAAAACCTTTAAGAT 
CAATCCCTTGACAGGAGAAATTGAACTAAAAAAACAACTCGATTTCGAAAAACTTCAGTCCT 
ATGAAGTCAATATTGAGGCAAGAGATGCTGGAACCTTTTCTGGAAAATGCACCGTTCTGATT 
CAAGTGATAGATGTGAACGACCATGCCCCAGAAGTTACCATGTCTGCATTTACCAGCCCAAT 
ACCTGAGAACGCGCCTGAAACTGTGGTTGCACTTTTCAGTGTTTCAGATCTTGATTCAGGAG 
AAAATGGGAAAATTAGTTGCTCCATTCAGGAGGATCTACCCTTCCTCCTGAAATCCGCGGAA 
AACTTTTACACCCTACTAACGGAGAGACCACTAGACAGAGAAAGCAGAGCGGAATACAACAT 
CACTATCACTGTCACTGACTTGGGGACCCCTATGCTGATAACACAGCTCAATATGACCGTGC 
TGATCGCCGATGTCAATGACAACGCTCCCGCCTTCACCCAAACCTCCTACACCCTGTTCGTC 
CGCGAGAACAACAGCCCCGCCCTGCACATCCGCAGCGTCAGCGCTACAGACAGAGACTCAGG 
CACCAACGCCCAGGTCACCTACTCGCTGCTGCCGCCCCAGGACCCGCACCTGCCCCTCACAT 
CCCTGGTCTCCATCAACGCGGACAACGGCCACCTGTTCGCCCTCAGGTCTCTGGACTACGAG 
GCCCTGCAGGGGTTCCAGTTCCGCGTGGGCGCTTCAGACCACGGCTCCCCGGCGCTGAGCA6 
CGAGGCGCTGGTGCGCGTGGTGGTGCTGGACGCCAACGACAACTCGCCCTTCGTGCTGTACC 
CGCTGCAGAACGGCTCCGCGCCCTGCACCGAGCTGGTGCCCCGGGCGGCCGAGCCGGGCTAC 
CTGGTGACCAAGGTGGTGGCGGTGGACGGCGACTCGGGCCAGAACGCCTGGCTGTCGTACCA 
GCTGCTCAAGGCCACGGAGCTCGGTCTGTTCGGCGTGTGGGCGCACAATGGCGAGGTGCGCA 
CCGCCAGGCTGCTGAGCGAGCGCGACGCGGCCAAGCACAGGCTGGTGGTGCTGGTCAAGGAC 
AATGGCGAGCCTCCGCGCTCGGCCACCGCCACGCTGCACGTGCTCCTGGTGGACGGCTTCTC 
CCAGCCCTACCTGCCTCTCCCGGAGGCGGCCCCGACCCAGGCCCAGGCCGACTTGCTCACCG 
TCTACCTGGTGGTGGCGTTGGCCTCGGTGTCTTCGCTCTTCCTCTTTTCGGTGCTCCTGTTC 
GTGGCGGTGCGGCTGTGTAGGAGGAGCAGGGCGGCCTCGGTGGGTCGCTGCTTGGTGCCCGA 
GGGCCCCCTTCCAGGGCATCTTGTGGACATGAGCGGCACCAGGACCCTATCCCAGAGCTACC 
AGTATGAGGTGTGTCTGGCAGGAGGCTCAGGGACCAATGAGTTCAAGTTCCTGAAGCCGATT 
ATCCCCAACTTCCCTCCCCAGTGCCCTGGGAAAGAAATACAAGGAAATTCTACCTTCCCCAA 
TAACTTTGGGTTCAATATTCAGTGACCATAGTTGACTTTTACATTCCATAGGTATTTTATTT 
TGTGGCATTTCCATGCCAATGTTTATTTCCCCCAATTTGTGTGTATGTAATATTGTACGGAT 
TTACTCTTGATTTTTCTCATGTTCTTTCTCCCTTTGTTTTAAAGTGAACATTTACCTTTATT 



FIGURE 163 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA48314 
<subunit 1 of 1, 798 aa, 1 stop 
<MW: 87552, pi: 4.84, NX(S/T): 5 

MEASGKLICRQRQVLFSFLLLGLSLAGAAEPRSYSVVEETEGSSFVTNLAKDLGLEQREFSR 
RGVRWSRGNKLHLQLNQETADLLLNEKLDREDLCGHTEPCVLRFQVLLESPFEFFQAELQV 
IDINDHSPVFLDKQMLVKVSESSPPGTTFPLKNAEDLDVGQNNIENYIISPNSYFRVLTRKR 
SDGRKYPELVLDKALDREEEAELRLTLTALDGGSPPRSGTAQVYIEVLDVNDNAPEFEQPFY 
RVQISEDSPVGFLWKVSATDVDTGVNGEISYSLFQASEEIGKTFKINPLTGEIELKKQLDF 
EKLQSYEVNIEARDAGTFSGKCTVLIQVIDVNDHAPEVTMSAFTSPIPENAPETWALFSVS 
DLDSGENGKISCSIQEDLPFLLKSAENFYTLLTERPLDRESRAEYNITITVTDLGTPMLITQ 
LNMTVLIADVNDNAPAFTQTSYTLFVRENNSPALHIRSVSATDRDSGTNAQVTYSLLPPQDP 
HLPLTSLVSINADNGHLFALRSLDYEALQGFQFRVGASDHGSPALSSEALVRVWLDANDNS 
PFVLYPLQNGSAPCTELVPRAAEPGYLVTKWAVDGDSGQNAWLSYQLLKATELGLFGVWAH 
NGEVRTARLLSERDAAKHRLWLVKDNGEPPRSATATLHVLLVDGFSQPYLPLPEAAPTQAQ 
ADLLTVYLWALASVSSLFLFSVLLFVAVRLCRRSRAASVGRCLVPEGPLPGHLVDMSGTRT 
LSQSYQYEVCLAGGSGTNEFKFLKPIIPNFPPQCPGKEIQGNSTFPNNFGFNIQ 

Important features: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain : 

amino acids 685-712 

Cadherins extracellular repeated domain signature. 

amino acids 122-132, 231-241, 336-346, 439-449 and 549-559 

ATP/GTP -binding site motif A (P-loop) . 

amino acids 285-292 

N-glycosylation site. 

amino acids 418-421, 436-439, 567-570 and 786-789 



FIGURE 164 



ACCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCGCGTAGCCGTGC 
GCCGATTGCCTCTCGGCCTGGGCAATGGTCCCGGCTGCCGGTCGACGACCGCCCCGCGTCAT 
GCGGCTCCTCGGCTGGTGGCAAGTATTGCTGTGGGTGCTGGGACTTCCCGTCCGCGGCGTGG 
AGGTTGCAGAGGAAAGTGGTCGCTTATGGTCAGAGGAGCAGCCTGCTCACCCTCTCCAGGTG 
GGGGCTGTGTACCTGGGTGAGGAGGAGCTCCTGCATGACCCGATGGGCCAGGACAGGGCAGC 
AGAAGAGGCCAATGCGGTGCTGGGGCTGGACACCCAAGGCGATCACATGGTGATGCTGTCTG 
TGATTCCTGGGGAAGCTGAGGACAAAGTGAGTTCAGAGCCTAGCGGCGTCACCTGTGGTGCT 
GGAGGAGCGGAGGACTCAAGGTGCAACGTCCGAGAGAGCCTTTTCTCTCTGGATGGCGCTGG 
AGCACACTTCCCTGACAGAGiyiGAGGAGTATTACACAGAGCCAGAAGTGGCGGAATCTGACG 
CAGCCCCGACAGAGGACTCCAATAACACTGAAAGTCTGAAATCCCCAAAGGTGAACTGTGAG 
GAGAGAAACATTACAGGATTAGAAAATTTCACTCTGAAAATTTTAAATATGTCACAGGACCT 
TATGGATTTTCTGAACCCAAACGGTAGTGACTGTACTCTAGTCCTGTTTTACACCCCGTGGT 
GCCGCTTTTCTGCCAGTTTGGCCCCTCACTTTAACTCTCTGCCCCGGGCATTTCCAGCTCTT 
CACTTTTTGGCACTGGATGCATCTCAGCACAGCAGCCTTTCTACCAGGTTTGGCACCGTAGC 
TGTTCCTAATATTTTATTATTTCAAGGAGCTAAACCAATGGCCAGATTTAATCATACAGATC 
GAACACTGGAAACACTGAAAATCTTCATTTTTAATCAGACAGGTATAGAAGCCAAGAAGAAT 
GTGGTGGTAACTCAAGCCGACCAAATAGGCCCTCTTCCCAGCACTTTGATAAAAAGTGTGGA 
CTGGTTGCTTGTATTTTCCTTATTCTTTTTAATTAGTTTTATTATGTATGCTACCATTCGAA 
CTGAGAGTATTCGGTGGCTAATTCCAGGACAAGAGCAGGAACATGTGGAGTAGTGATGGTCT 
GAAAGAAGTTGGAAAGAGGAACTTCAATCCTTCGTTTCAGAAATTAGTGCTACAGTTTCATA 
CATTTTCTCCAGTGACGTGTTGACTTGAAACTTCAGGCAGATTAAAAGAATCATTTGTTGAA 
CAACTGAATGTATAAAAAAATTATAAACTGGTGTTTTAACTAGTATTGCAATAAGCAAATGC 
AAAAATATTCAATAG 



FIGURE 165 

>< /usr / seqdb2 / s s t /DNA/Dnas eqs . min/ s s . DNA4 8 3 3 3 
xsubunit 1 of 1, 360 aa, 1 stop 
xMW: 39885, pi: 4.79, NX(S/T) : 7 

MVPAAGRRPPRVMRLLGWWQVLLWVLGLPVRGVEVAEESGRLWSEEQPAHPLQVGAVYLGEE 
ELLHDPMGQDRAAEEANAVLGLDTQGDHMVMLSVIPGEAEDKVSSEPSGVTCGAGGAEDSRC 
NVRESLFSLDGAGAHFPDREEEYYTEPEVAESDAAPTEDSNNTESLKSPKVNCEERNITGLE 
NFTLKILmSQDLMDFLNPNGSDCTLVLFYTPWCRFSASLAPHFNSLPRAFPALHFLALDAS 
QHSSLSTRFGTVAVPNILLFQGAKPMARFNHTDRTIiETLKIFIFNQTGIEAKKNVVVTQADQ 
IGPLPSTLIKSVDWLLVFSLFFLISFIMYATIRTESIRWLIPGQEQEHVE 

Important features: 
Signal peptide: 

amino acids 1-25 

Transmembrane domain: 

amino acids 321-340 

Homologous region to dilsufide isomerase 

amino acids 212-302 

N-glycosylation site. 

amino acids 165-168, 181-184, 187-190, 194-197, 206-209, 278-281 
and 293-296 

Thioredoxin domain 

amino acids 211-227 



FIGURE 166 



CCCGGCTCCGCTCCCTCTGCCCCCTCGGGGTCGCGCGCCCACGATGCTGCAGGGCCCTGGCT 
CGCTGCTGCTGCTCTTCCTCGCCTCGCACTGCTGCCTGGGCTCGGCGCGCGGGCTCTTCCTC 
TTTGGCCAGCCCGACTTCTCCTACAAGCGCAGCAATTGCAAGCCCATCCCGGTCAACCTGCA 
GCTGTGCCACGGCATCGAATACCAGAACATGCGGCTGCCCAACCTGCTGGGCCACGAGACCA 
TGAAGGAGGTGCTGGAGCAGGCCGGCGCTTGGATCCCGCTGGTCATGAAGCAGTGCCACCCG 
GACACCAAGAAGTTCCTGTGCTCGCTCTTCGCCCCCGTCTGCCTCGATGACCTAGACGAGAC 
CATCCAGCCATGCCACTCGCTCTGCGTGCAGGTGAAGGACCGCTGCGCCCCGGTCATGTCCG 
CCTTCGGCTTCCCCTGGCCCGACATGCTTGAGTGCGACCGTTTCCCCCAGGACAACGACCTT 
TGCATCCCCCTCGCTAGCAGCGACCACCTCCTGCCAGCCACCGAGGAAGCTCCAAAGGTATG 
TGAAGCCTGCAAAAATAAAAATGATGATGACAACGACATAATGGAAACGCTTTGTAAAAATG 
ATTTTGCACTGAAAATAAAAGTGAAGGAGATAACCTACATCAACCGAGATACCAAAATCATC 
CTGGAGACCAAGAGCAAGACCATTTACAAGCTGAACGGTGTGTCCGAAAGGGACCTGAAGAA 
ATCGGTGCTGTGGCTCAAAGACAGCTTGCAGTGCACCTGTGAGGAGATGAACGACATCAACG 
CGCCCTATCTGGTCATGGGACAGAAACAGGGTGGGGAGCTGGTGATCACCTCGGTGAAGCGG 
TGGCAGAAGGGGCAGAGAGAGTTCAAGCGCATCTCCCGCAGCATCCGCAAGCTGCAGTGCTA 
GTCCCGGCATCCTGATGGCTCCGACAGGCCTGCTCCAGAGCACGGCTGACCATTTCTGCTCC 
GGGATCTCAGCTCCCGTTCCCCAAGCACACTCCTAGCTGCTCCAGTCTCAGCCTGGGCAGCT 
TCCCCCTGCCTTTTGCACGTTTGCATCCCCAGCATTTCCTGAGTTATAAGGCCACAGGAGTG 
GATAGCTGTTTTCACCTAAAGGAAAAGCCCACCCGAATCTTGTAGAAATATTCAAACTAATA 
AAATCATGAATATTTTAA 



FIGURE 167 



></usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA50920 
xsubxmit 1 of 1, 295 aa, 1 stop 
xMW: 33518, pi: 7.74, NX(S/T): 0 

MLQGPGSLLLLFLASHCCLGSARGLFLFGQPDFSYKRSNCKPIPVNLQLCHGIEYQNMRLPN 
LLGHETMKEVLEQAGAWIPLVMKQCHPDTKPCFLCSLFAPVCLDDLDETIQPCHSLCVQVKDR 
CAPVMSAFGFPWPDMLECDRFPQDNDLC I PLAS SDHLLPATEEAPKVCE ACKNKNDDDND IM 
ETLCKNDFALKIKVKEITYINRDTKIILETKSKTIYKI^GVSERDLKKSVLWLKDSLQCTCE 
EMNDINAPYLVMGQKQGGELVITSVKRWQKGQREFKRISRSIRKLQC 

Important features : 
Signal peptide: 

amino acids 1-20 

Cysteine rich domain, homolgous to frizzled N terminus 

amino acids 6-153 



FIGURE 168 



GTGGAGGCCGCCGACGATGGCGGGGCCGACGGAGGCCGAGACGGGGTTGGCCGAGCCCCGGG 
CCCTGTGCGCGCAGCGGGGCCACCGCACCTACGCGCGCCGCTGGGTGTTCCTGCTCGCGATC 
AGCCTGCTCAACTGCTCCAACGCCACGCTGTGGCTCAGCTTTGCACCTGTGGCTGACGTCAT 
TGCTGAGGACTTGGTCCTGTCCATGGAGCAGATCAACTGGCTGTCACTGGTCTACCTCGTGG 
TATCCACCCCATTTGGCGTGGCGGCCATCTGGATCCTGGACTCCGTCGGGCTCCGTGCGGCG 
ACCATCCTGGGTGCGTGGCTGAACTTTGCCGGGAGTGTGCTACGCATGGTGCCCTGCATGGT 
TGTTGGGACCCAAAACCCATTTGCCTTCCTCATGGGTGGCCAGAGCCTCTGTGCCCTTGCCC 
AGAGCCTGGTCATCTTCTCTCCAGCCAAGCTGGCTGCCTTGTGGTTCCCAGAGCACCAGCGA 
GCCACGGCCAACATGCTCGCCACCATGTCGAACCCTCTGGGCGTCCTTGTGGCCAATGTGCT 
GTCCCCTGTGCTGGTCAAGAAGGGTGAGGACATTCCGTTAATGCTCGGTGTCTATACCATCC 
CTGCTGGCGTCGTCTGCCTGCTGTCCACCATCTGCCTGTGGGAGAGTGTGCCCCCCACCCCG 
CCCTCTGCCGGGGCTGCCAGCTCCACCTCAGAGAAGTTCCTGGATGGGCTCAAGCTGCAGCT 
CATGTGGAACAAGGCCTATGTCATCCTGGCTGTGTGCTTGGGGGGAATGATCGGGATCTCTG 
CCAGCTTCTCAGCCCTCCTGGAGCAGATCCTCTGTGCAAGCGGCCACTCCAGTGGGTTTTCC 
GGCCTCTGTGGCGCTCTCTTCATCACGTTTGGGATCCTGGGGGCACTGGCTCTCGGCCCCTA 
TGTGGACCGGACCAAGCACTTCACTGAGGCCACCAAGATTGGCCTGTGCCTGTTCTCTCTGG 
CCTGCGTGCCCTTTGCCCTGGTGTCCCAGCTGCAGGGACAGACCCTTGCCCTGGCTGCCACC 
TGCTCGCTGCTCGGGCTGTTTGGCTTCTCGGTGGGCCCCGTGGCCATGGAGTTGGCGGTCGA 
GTGTTCCTTCCCCGTGGC3GGAGGGGGCTGCCACAGGCATGATCTTTGTGCTGGGGCAGGCCG 
AGGGAATACTCATCATGCTGGCAATGACGGCACTGACTGTGCGACGCTCGGAGCCGTCCTTG 
TCCACCTGCCAGCAGGGGGAGGATCCACTTGACTGGACAGTGTCTCTGCTGCTGATGGCCGG 
CCTGTGCACCTTCTTCAGCTGCATCCTGGCGGTCTTCTTCCACACCCCATACCGGCGCCTGC 
AGGCCGAGTCTGGGGAGCCCCCCTCCACCCGTAACGCCGTGGGCGGCGCAGACTCAGGGCCG 
GGTGTGGACCGAGGGGGAGCAGGiy^GGGCTGGGGTCCTGGGGCCCAGCACGGCGACTCCGGA 
GTGCACGGCGAGGGGGGCCTCGCTAGAGGACCCCAGAGGGCCCGGGAGCCCCCACCCAGCCT 
GCCACCGAGCGACTCCCCGTGCGCAAGGCCCAGCAGCCACCGACGCGCCCTCCCGCCCCGGC 
AGACTCGCAGGCAGGGTCCAAGCGTCCAGGTTTATTGACCCGGCTGGGTCTCACTCCTCCTT 
CTCCTCCCCGTGGGTGATCACGTAGCTGAGCGCCTTGTAGTCCAGGTTGCCCGCCACATCGA 
TGGAGGCGAACTGGAACATCTGGTCCACCTGCGGGCGGGGGCGAAAGGGCTCCTTGCGGGCT 
CCGGGAGCGAATTACAAGCGCGCACCTGAAAA 
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FIGURE 169 



></usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA50988 
xsubunit 1 of 1, 560 aa^ 1 stop 
xMW: 58427, pi: 6.86, NX(S/T): 2 

MAGPTEAETGLAEPRALCAQRGHRTYARRWVFLLAI S LLNCSNATLWLSFAPVADVIAEDLV 

LSMEQINWLSLVYLWSTPFGVAAIWILDSVGLRAATILGAWLNFAGSVLRMVPCMWGTQN 

PFAFLMGGQSLCALAQSLVIFSPAKIjy^LWFPEHQRATAmLATMSNPLGVLVAlSrVLSP 

KKGEDIPLMLGVYTIPAGWCLLSTICLWESVPPTPPSAGAASSTSEKFLDGLKLQLMWNKA 

YVILAVCLGGMIGISASFSALLEQILCASGHSSGFSGLCGALFITFGILGALALGPYVDRTK 

HFTEATKIGLCLFSLACVPFALVSQLQGQTLALAATCSLLGLFGFSVGPVAMELAVECSFPV 

GEGAATGMIFVLGQAEGILIMLAMTALTVRRSEPSLSTCQQGEDPLDWTVSLLLMAGLCTFF 

SCILAVFFHTPYRRLQAESGEPPSTRNAVGGADSGPGVDRGGAGRAGVLGPSTATPECTARG 

ASLEDPRGPGSPHPACHRATPRAQGPAATDAPSRPGRLAGRVQASRFIDPAGSHSSFSSPWVIT 

Important features: 
Signal peptide: 

amino acids 1-44 

Transmembrane domains: 

amino acids 61-79, 98-112, 126-146, 169-182, 201-215, 248-268, 
280-300, 318-337, 341-357, 375-387, 420-441 

N-glycosylation site. 

amino acids 40-43 and 43-46 



Glycosaminoglycan attachment site. 

amino acids 468-471 



FIGURE 170 



GTCCCy^CaTCCTGCTCAACTGGGTCAGGTCCCTCTTAGACCAGCTCTTGTCCATCaTTTGCTGAAGTG^ 

TAGTTCCCCT^GTAGGGGGTCTCCCCTGGCAATTCTTGATCGGCGTTTGGACaTCTCAGATCGCTTCCAATGaAGA 

TGGCCTTGCCTTGGGGTCOTGCTTGTTTCATAATCaTCTiiACTATGGGACAAGGTTGTGCCGGCAGCTCTGGGG^ 

AAGGAGOiCGGGGCTGATCAAGCCaVTCCAGGAAACACTGGAGGACTTGTCCAGCCTTGAAAGAACT 

TCTGAATCTAGCCCACTTGGCGGTAAGCM6ATGCAACTTCTGCAACTTCTGCTGGGGCTTTTGGGGCCAGGTGG 

CTACTTATTTCTTTTAGGGGATTGTCAGGAGGTGACCACTCTCaCGGTGAAATACCAAGTGTCAGAGGAAGTGCC 

ATCTGGTACAGTQATCGGGAAGCTGTCCCAGGAACTGGGCCGGGAGGAGAGGCGGAGGCAAGCTGGGGCCGCCTT 

CCAGGTGTTGCAGCTGCCTCAGGCGCTCCCCATTCAGGTGGACTCTGAGQAAGGCTTGCTCAGCACAGGCAGGCG 

GCTGGATCGAGAGCAGCTGTGCCGACAGTGGGATCCCTGCCTGGTTTCCTTTGATGTGCTTGCCACAGGGGATTT 

GGCTCTGATCCaTGTGGAGATCCAAGTGCTGGACATCAATGACCACaVGCCACGGTTTCCCAAAGGCGAGC^^ 

GCTGGAAATCTCTGAQAGCGCCTCTCTGCGAACCCGQATCCCCCTGGACAGAGCTCTTGACCCAGACACAGGCCC 

TAACACCCTGCACACCTACACTCTGTCTCCCAGTGAGCACTTTGCCTTGGATGTCATTGTGGGCCCTGATGAGAC 

CAAACATGCAGAACTCATAGTGGTGAAGGAGCTGGACAGGGAAATCCATTCATTTTTTGATCTGGTGTTAACTGC 

CTATGACAATGGGAACCCCCCCAAGTCAGGTACCaGCTTGGTCAAGGTCyiACGTCTTGGACrCCAAT^ 

CCCTGCGTTTGCTGAGAGTTCACTGGCACTGGAAATCCAAGAAGATGCTGCACCTGGTACGCTTCTCATAAAACT 

GACCGCCaCAGACCCTGACCAAGGCCCCSU^TGGGGAGGTGGAGTTCTTCCTCAGTAAGCACATGCCTCC^ 

GCTGGACACCTTCAGTATTGATGCOUlGACAGGCCAGGTCaTTCTGCGTCGACCTCTAGACTATGAAAAGAA 

TGCCTACGAGGTGQATGTTCAGGCAAGGGACCTGGGTCCa^TCCTATCCCAGCCCATTGCAAAGTTCTC^^ 

GGTTCTGGATGTCAATGACAACATCCCy^GCATCCyvCGTCaCATGGGCCTCCCAGCCATCACTG^^ 

TCTTCCCAAGGACaGTTTTATTGCTCTTGTCATGGCAGATGACrTGGATTCAGGACACAATGGTTTGGTCCaCT^ 

CTGGCTGAGCCAAGAGCTGGGCCACTTCAGGCTGAAAAGAACTAATGGCAACACATACATGTTGCTAACC^ 

CACACTGGACAGAGAGCAGTGGCCCAAATATACCCTCACTCTGTTAGCCCAAGACCyykGGACTCCa^ 

AGCCAAGAAACAGCTCaGCATTCAGATCAGTGACATCAACGACAATGCACCTGTGTTTGAGAAAAGC^^ 

AGTCTCCACGCGGQAAAACAACTTACCCTCTCTTCaCCTCATTACCATCAAGGCTCATGATGCAGACTTGGGCAT 

TAATGGAAAAGTCTCATACCGCATCCAGGACTCCCCAGTTGCTCACTTAGTAGCTATTGACTCCAACACAGGAGA 

GGTCACTGCTCAQAGGTCACTGAACTATGAAGAGATGGCCGGCTTTGAGTTCCAGGTGATCGCAGAGGACAGCGG 

GCaACCCATGCTTGCATCCAGTGTCTCTGTGTGGGTCAGCCTCTTGGATGCCAATGATAATGCCCCAGAGGTGGT 

CCa^GCCTGTGCTCAGCGATGGAAAAGCCAGCCTCTCCGTGCTTGTGAATGCCTCCACAGGCCACCTGCTGGTGCC 

CaTCGAGACTCCCaATGGCTTGGGCCCAGCGGGCACTGACACACCTCCACTGGCCACTCACAGCTCCCGGCCATT 

CCTTTTGACAACCATTGTGGOiAGAGATGCAGACTCGGGGGCAAATGGAGAGCCCCTCTACAGCATCCGCAATGG 

AAATGAAGCCCACCTCTTCATCCTCAACCCTCATACGGGGCAGCTGTTCGTCAATGTCACCAATGCCAGCAGCCT 

CATTGGGAGTGAGTGGGAGCTGGAQATAGTAGTAQAGGACCAGGQAAGCCCCCCCTTACAGACCCGAGCCCTGTT 

GAGGGTCATGTTTGTCACCAGTGTGGACCACCTGAGGGACTCAGCCCGCAAGCCTGGGGCCTTGAGCATGTCGAT 

GCTGACGGTQATCTGCCTGGCTGTACTGTTGGGCATCTTCGGGTTQATCCTGGCTTTGTTCATGTCCATCTGCCG 

GACAGAAAAGAAGGACy^CaGGGCCTACAACTGTCGGGAGGCCGAGTCCACCTACCGCCAGCAGCCCAAGAGGCC 

CCAGaAACACATTCAGAAGGCAGACATCCACCTCGTGCCTGTGCTCAGGGGTCAGGCAGGTGAGCCTTGTGAAGT 

CGGGCAGTCCCACAAAGATGTGGACAAGGAGGCGATGATGGAAGCAGGCTGGGACCCCTGCCTGCAGGCCCCCTT 

GCTGCAAQACACGGTCaACCTCCTTTTC^CCATCCCAGGCAGAGGAATGCCTCCCGGGAGAACCTGAACCTTCC 

CGAGCCCCAGCCTGCCACy^GGCCAGCCACGTTCCAGGCCTCTGAAGGTTGCAGGCAGCCCCACAGGGAGGCTGGC 

TGGAGACCAGGGCAGTGAGGAAGCCCCACAGAGGCCACCAGCCTCCTCTGCAACCCTGAGACGGCAGCGACATCT 

CAATGGCAAAGTGTCCCCTGAGAAAGAATCAGGGCCCCGTCAQATCCTGCGGAGCCTGGTCCGGCTGTCTGTGGC 

TGCCTTCGCCGAGCGGAACCCCGTGGAGGAGCTCACTGTGGATTCTCCTCCTGTTCAGCAAATCTCCCAGCTGCT 

GTCCTTGCTGCATCaGGGCCAATTCCAGCCCAAACCAAACCACCGAGGAAATAAGTACTTGGCCAAGCCAGGA^ 

CAGCAGGAGTGOVATCCCAGACACAGATGGCCCAAGTGCAAGGGCTGGAGGCCAGACAGACCCAGAA^ 

AGGGCCTTTGGATCCTGAAGAGGACCTCTCTGTGAAGCAACTGCTAGAAGAAGAGCTGTCAAGTCTGCTGGACCC 

CAGCACAGGTCTGGCCCTGGACCGGCTGAGCGCCCCTGACCCGGCCTGGATGGCGAGACTCTCTTTGCCCCTCAC 

CACCAACTACCGTGACAATGTGATCTCCCCGGATGCTGCAGCCACGGAGGAGCCGAGQACCTTCCAGACGTTCGG 

CAAGGCAGAGGCACa^GAGCTGAGCCCaACAGGCACGAGGCTGGCCAGCACCTTTGTCTCGGAQATGAGCTC^^ 

GCTGGAGATGCTGCTGGAACAGCGCTCCAGCATGCCCGTGGAGGCCGCCTCCGAGGCGCTGCGGCGGCTCTCGGT 

CTGCGGGAGGACCCTCAGTTTAGACTTGGCCACCaGTGCAGCCTCAGGCATGAAAGTGCAAGGGGACCCAGGTGG 

AAAGACGGGGACTQAGGGCAAGAGCAGAGGCAGCy^GCAGCaGCAGCAGGTGCCTGTGAACATACCTCAGAC^^ 

CTGGATCCAAGAACCAGGGGCCTGAGGATCTGTGGACAAGAGCTGGTTTCTAAAATCTTGTAACTCACTAGCTAG 

CGGCGGCCTGAGAACTTTAGGGTGACTGATGCTACCCCCACAGAGGAGGCAAGAGCCCCAGGACTAACAGCTGAC 

TGACCAAAGCAGCCCCTTGTAAGCAGCTCTGA6TCTTTTGGAGQACAGGGACGGTTTGTGGCTGAGATAAGTGTT 

TCCTGGCAAAACATATGTGGAGCACAAAGGGTCAGTCCTCTGGCAGAACAGATGCC^^ 

AAAGGGTGGCCTTCTTGGGTAGCAGGAGTCAGGGGGCTGTACCCTGGGGGTGCCAGGAAATGCTCTCTGACCTAT 
CAATAAAGGAAAAGCAGTAAAAAAAAiiSAAAAAAAAAAA 



FIGURE 171 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA48331 
<subunit 1 of 1, 1184 aa, 1 stop 
<MW: 129022, pi: 5.20, NX{S/T) : 5 

MMQLLQLLLGLLGPGGYLFLLGDCQEVTTLTVKYQVSEEVPSGTVIGKLSQEIiGREERRRQA 
GAAFQVLQLPQALPI QVDSEEGLLSTGRRLDREQLCRQWDPCLVS FDVLATGDLAL IHVE I Q 
VLDINDHQPRFPKGEQELEI SESASLRTRI PLDRALDPDTGPNTLHTYTLSPSEHFALDVIV 
GPDETKHAELIWKELDREIHSFFDLVLTAYDNGNPPKSGTSLVKVNVLDSNDNSPAFAESS 
LALE I QEDAAPGTLLI KLTATDPDQGPNGEVEFFLSKHMPPEVLDTFS IDAKTGQVI LRRPL 
DYEKNPAYEVDVQARDLGPNP I PAHCKVLI KVLDVNDNI PSIHVTWASQPS LVSEALPKDS F 
lALVMADDLDSGHNGLVHCWLSQELGHFRLKRTNGNTYMLLTNATLDREQWPKYTLTLLAQD 
QGLQPLSAKKQLSIQISDINDNAPVFEKSRYEVSTRENNLPSLHLITIKAHDADLGINGKVS 
YRIQDSPVAHLVAIDSNTGEVTAQRSLNYEEMAGFEFQVIAEDSGQPMLASSVSVWVSLLDA 
NDNAPEWQPVLSDGKASLSVLVNASTGHLLVPIETPNGLGPAGTDTPPLATHSSRPFLLTT 
IVARDADSGANGEPLYSIRNGNEAHLFILNPHTGQLFVNVTNASSLIGSEWELEIWEDQGS 
PPLQTRALLRVMFVTSVDHLRDSARKPGALSMSMLTVICLAVLLGIFGLILALFMSICRTEK 
KDNRAYNCREAESTYRQQPKRPQKHIQKADIHLVPVLRGQAGEPCEVGQSHKDVDKEAMMEA 
GWDPCLQAPFHLTPTLYRTLRNQGNQGAPAESREVLQDTVNLLFNHPRQRNASRENLNLPEP 
QPATGQPRSRPLKVAGSPTGRLAGDQGSEEAPQRPPASSATLRRQRHLNGKVSPEKESGPRQ 
I LRSLVRLSVAAFAERNPVEELTVDS PPVQQI SQLLSLLHQGQFQPKPNHRGNKYLAKPGGS 
RSAIPDTDGPSARAGGQTDPEQEEGPLDPEEDLSVKQLLEEELSSLLDPSTGLALDRLSAPD 
PAWMARLSLPLTTNYRDNVISPDAAATEEPRTFQTFGKAEAPELSPTGTRLASTFVSEMSSL 
LEMLLEQRSSMPVEAASEALRRLSVCGRTLSLDLATSAASGMKVQGDPGGKTGTEGKSRGSS 
SSSRCL 

Important features: 
Signal peptide: 

amino acids 1-13 
Transmembrane domain: 
amino acids 719-739 
N-glycosylation site. 

amino acids 415-418, 582-585, 659-662, 662-665 amd 857-860 

Cadherins extracellular repeated domain signature. 

amino acids 123-133, 232-242, 340-350, 448-458 and 553-563 



FIGURE 172 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGG 
CA6ACCGTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAG 
GCAGGAGCCTTCCTTACACTTCGCC ATGA GTTTCCTCATCGACTCCAGCATCATGATTACCT 
CCCAGATACTATTTTTTGGATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTAT 
GAGATACGTCAGTATGTTGTACAGGTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCAT 
GTTTGAGCTCATCATCTTTGAAATCTTAGGAGTATTGAATAGCAGCTCCCGTTATTTTCACT 
GGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGGTTTTCATGGTGCCTTTTTACATTGGC 
TATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGACTGCTTTTTTCCTGTCTCTT 
ATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCCCATTCTCAGCCCAA 
AACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAGTGACTCTC 
ATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTCCT 
CAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATA 
TGATCATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAA 
GTGCATAACAAACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGG 
AAGTGAAAATCTTACTCTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGC 
TTTTTCTGGAAACAGCTGATCTATATGCTACCAAGGAGAGAATAGAATACTCCAAAACCTTC 
AAGGGGAAATATTTTAATTTTCTTGGTTACTTTTTCTCTATTTACTGTGTTTGGAAAATTTT 
CATGGCTACCATCAATATTGTTTTTGATCGAGTTGGGAAAACGGATCCTGTCACAAGAGGCA 
TTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGTGAAGTTTTGGTCCCAACACATT 
TCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGATTGCTGATCACTCTTAC 
CAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTGCTATTAGCAC 
AGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTTAGAA 
TACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTT 
TGATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAAC 
AGGCACCAGAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGT 
GGTTTCAAAATTTAGATATAAGAGGGGGGAAAAATGG2UICCAGGGCCTGACATTTTATAAAC 
AAACAAAATGCTATGGTAGCATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATA 
CTATGACCATGAGTAGCATCAGCCAGAACATGAGAGGGAGAACTAACTCAAGACAATACTCA 
GCAGAGAGCATCCCGTGTGGATATGAGGCTGGTGTAGAGGCGGAGAGGAGCCAAGAAACTAA 
AGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGTCTATGGTAGCTGAGCCAAACACGT 
AGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTGCCTTGAGATTGACTCATT 
AAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTCG 
ACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 



FIGURE 173 



MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEI 
LGVLNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFF 
WKLGDPFPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRNVTDTDI 
lALERRLLQTMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLI^ 
QEVDALEELSRQLFLETADLYATKERI E YSKTFKGKYFNFLGYFFS I YCVWKI FMAT INI VF 
DRVGKTDPVTRGIEITVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISS 
SKSSNVIVLLLAQIMGMYFVSSVLLIRMSMPLEYRTIITEVLGELQFNFYHRWFDVIFLVSA 
LSSILFLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin -binding proteins 

amino acids 151-160 



FIGURE 174 



CATGGGAAGTGGAGCCGGAGCCTTCCTTACACTCGCCATGAGTTTCCTCATCGACTCCAGCA 
TCATGATTACCTCCCNGANACTATTTTTTGGATTTGGGTGGCTTTTCTTCNGCGCCAATGTT 
TAAAGACTATGAGATACGTCAGTATGTTGTACNGGTGATCTTCTCCGTGACGTTTGCCATTT 
CTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTNGGAGTATTGAATAGCAGCTCCCGT 
TATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGGTTNTCATGGTGCCTTT 
TTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGACTGCTTTTTT 
CCTGTCTCTTATGGCTGACCTTTATGTATTTCCAG 



FIGURE 175 



GTGTTGCCCTTGGGGAGGGGAAGGGGAGCCNGGCCCTTTCCTAAJ^TTTGGCCAAGGGTTTC 
TTTNTTGAA.TTCCGGGTTNNGNATACCTTCCCAGAAAATATTTTTTGGATTTGGGGTAGNTT 
TTTTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAGGTGATNTT 
NTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATNTTTGAAATNTTAGGAG 
TATTGJ^TAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATC 
CTGGTTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCA 
TAAACAACGACTGCTTTTTTCCTGTCTNTTATGGCTGACCTTTATGTATTTNTTNTGGAAAN 
TAGGAGATCCCTTTCCCATTCTC 



FIGURE 176 



CTCGCGCAGGG&TCGTCCCATGGCCGGGGCTCGGAGCCGCGACCCTTGGGGQGCCTCCGGGATTTGCTACCTTTT 

TGGCTCCCTGCTCGTCGAACTGCTCTTCTCACGGGCTGTCGCCTTCAATCTGGACGTGATGGGTGCCTTGCGCSlA 

GGAGGGCGAGC<^GGCAGCCTCTTCGGCTTCTCTGTGGCCCTGCACCGGCAGTTGCAGCCCC6ACCCCAGAGCTG 

GCTGCTGGTGGGTGCTCCCCAGGCCCTGGCTCTTCCTGGGCAGCaGGCGAATCGCACTGGAGGCCTCTTCGCTTG 

CCCGTTGAGCCTGGAGGAGACTGACTGCTACAGAGTGGACATCGACCAGGGAGOTGATATGCAAAAGGAAA^^ 

GGAGAACCAGTGGTTGGGAGTCAGTGTTCGGAGCCAGGGGCCTGGGGGCAAGATTGTTACCTGTGCACACCGATA 

TGAGGCaAGGCAGCGAGTGGACCAGATCCTGGAGACGCGGGATATGATTGGTCGCTGCTTTGTGCTCAGCCAGGA 

CCTGGCCATCCGGGATGAGTTGGATGGTGGGGAATGGAAGTTCTGTGAGGGACGCCCCCAAGGCCATGAACAATT 

TGGGTTCTGCCAGCAGGGCACAGCTGCCGCCTTCTCCCCTGATAGCCACTACCTCCTCTTTGGGGCCCCAGGAAC 

CTATAATTGGAAGGGCACGGCCAGGGTGGAGCTCTGTGCACAGGGCTCAGCGGACCTGGCACACCTGGACGACGG 

TCCCTACGAGGCGGGGGGAGAGAAGGAGCAGGACCCCCGCCTCATCCCGGTCCCTGCCAACAGCTACTTTGGCTT 

CTCTATTGACTCGGGGAl^GGTCTGGTGCGTGCAGAAGAGCTGAGCTTTGTGGCTGQAGCCCCCCGCGCCAACCA 

CAAGGGTGCTGTGGTCATCCTGCGCAAGGACAGCGCCAGTCGCCTGGTGCCCGAGGTTATGCTGTCTGGGGAGCG 

CCTGACCTCCGGCTTTGGCTACTCACTGGCTGTGGCTGACCTCAACAGTGATGGCTGGCCAGACCTGATAGTGGG 

TGCCCCCTACTTCTTTGAGCGCCAAGAAGAGCTGGGGGGTGCTGTGTATGTGTACTTGAACCAGGGGGGTC3VCTG 

GGCTGGGATCTCCCCTCTCCGGCTCTGCGGCTCCCCTGACTCCATGTTCGGGATCAGCCTGGCTGTCCTGGGGQA 

CCTa^CCAAGATGGCTTTCCAGATATTGCa.GTGGGTGCCCCCTTTGATGGTGATGGGAAAGTCTTCATCTACCa 

TGGGAGCAGCCTGGGGGTTGTCGCCAAACCTTCACAGGTGCTGGAGGGCGAGGCTGTGGGCATCAAGAGCTTCGG 

CTACTCCCTGTCAGGCAGCTTGGATATGGATGGGAACCAATACCOTGACCTGCTGGTGGGCTCCCTGGCTGACAC 

CGCAGTGCTCTTCAGGGCCAGACCCATCCTCCATGTCTCCCATGAGGTCTCTATTGCTCCACGAAGCATCGACCT 

GGAGCAGCCCAACTGTGCTGGCGGCCACTCGGTCTGTGTGGACCTAAGGGTCTGTTTCAGCTACATTGCAGTCCC 

CAGCAGCTATAGCCCTACTGTGGCCCTGGACTATGTGTTAGATGCGGACACAGACCGGAGGCTCCGGGGCCAGGT 

TCCCCGTGTGACGTTCCTGAGCCGTAACCTGGAAGAACCCa^GCACCaGGCCTCGGGC^CCGTGTGGCTGAAGC^ 

CCa.GCATGACCGAGTCTGTGGAGACGCCaTGTTCCAGCTCCAGGAAAATGTCAAAGACAAGCTTCGGGCC^ 

AGTGACCTTGTCCTACAGTCTCCAGACCCCTCGGCTCCGGCGACAGGCTCCTGGCCAGGGGCTGCCTCCAGTGGC 

CCCCATCCTCAATGCCCACCAGCCCAGa^CCCAGCGGGCAGAGATCCaCTTCCTGAAGCS^GGCT 

CaAGATCTGCCaGAGCAATCTGCaVGCTGGTCCACGCCCGCTTCTGTACCCGGGTCAGCGACACGGAATTC 

TCTGCCa^TGGATGTGGATGGAACaACAGCCCTGTTTGCACTGAGTGGGCAGCCAGTC^TTGGCCTGGAGCTGAT 

GGTCSlCCAACCTGCCATCGGACCCAGCCCAGCCCCAGGCTGATGGGGATGATGCCC^TGAAGCCCAGCTCCTGGT 

OVTGCTTCCTGACTCACTGCACTACTCAGGGGTCCGGGCCCTGGACCCTGCGGAGZ^GCCACTCTGCCTGTCCAA 

TQAGAATGCCTCCCATGTTGAGTGTGAGCTGGGGAACCCCATGAAGAGAGGTGCCCAGGTCACCTTCTACCTC»l^ 

CCTTAGCACCTCCGGGATCAGCaTTGAGACCACGGAACTGGAGGTAGAGCTGCTGTTGGCCACGATCAGTGAGCA 

GGAGCTGCATCO^GTCTCTGCACGAGCCCGTGTCTTCATTGAGCTGCCACTGTCaiTTGCAGGAATGGCC^ 

CCaGCaACTCTTCTTCTCTGGTGTGGTGAGGGGCGAGAGAGCCATGCAGTCTGAGCGGGATGTGGGCAGCAAGGT 

O^GTATGAGGTCACGGTTTCCaACa^GGCCAGTCGCTCAGAACCCTGGGCTCTGCCTTCCTCaAC^ 

GCCTO^TGAGATTGCCAATGGGAAGTGGTTGCTGTACCCAATGCAGGTTGAGCTGCAGGGCGGGCAGGGGCCTGG 

GCAGAAAGGGCTTTGCTCTCCCaGGCCCAACATCCTCCACCTGGATGTGGACAGTAGGGATAGGAGGCGGCG^ 

GCTGGAGCOVCCTGAGCAGOVGGAGCCTGGTGAGCGGCAGGAGCCCAGCATGTCCTGGTGGCCAGTGTCCTCTGC 

TGAQAAGAAQAAAAACATCACCCTGGACTGCGCCCGGGGCACGGCCAACTGTGTGGTGTTCAGCTGCCCACTCTA 

CAGCTTTGACCGCGCGGCTGTGCTGCATGTCTGGGGCCGTCTCTGGAACAGCACCTTTCTGGAGGAGTACTCAGC 

TGTGAAGTCCCTGGAAGTGATTGTCCGGGCCAACATCACAGTG2UVGTCCTCCATAAAGAACTTGATGCTCCGAGA 

TGCCTCCACAGTGATCCCAGTGATGGTATACTTGGACCCCATGGCTGTGGTGGCAGAAGGAGTGCCCTGGTGGGT 

CATCCTCCTGGCTGTACTGGCTGGGCTGCTGGTGCTAGCACTGCTGGTGCTGCTCCTGTGGAAGATGGGATTCTT 

CAAACGGGCGAAGCACCCCGAGGCCACCGTGCCCCAGTACCATGCGGTGAAGATTCCTCGGGAAGACCGAaV^^ 

GTTCAAGGAGGAGAAGACGGGCACCATCCTGAGGAACAACTGGGGCAGCCCCCG6CGGGAGGGCCCGGATGCACA 

CCCCATCCTGGCTQCTGACGGGCATCCCGAGCTGGGCCCCGATGGGCATCCAGGGCCAGGCACCGCCTAGGTTCC 

CATGTCCCAGCCTGGCCTGTGGCTGCCCTCCATCCCTTCCCCAGAGATGGCTCCTTGGGATGAAGAGGGTAGAGT 

GGGCTGCTGGTGTCGCATCAAGATTTGGCAGGATCGGCTTCCTCyVGGGGCACAGACCTCTCCavCCCACAAG^ 

TCCTCCCACCCAACTTCCCCTTAGAGTGCTGTGAGATGAGAGTGGGTAAATCAGGGACAGGGCCATGGGGTAGGG 

TGAGAAGGGCAGGGGTGTCCTGATGCA?^GGTGGGGAGAAGGGATCCTAATCCCTTCCTCTCCCATTCACCCTGT 

GTAACAGGACCCCAAGGACCTGCCTCCCCGGAAGTGCCTTAACCTAGAGGGTCGGGGAGGAGGTTGTGTCACTGA 

CTCAGGCTGCTCCTTCTCTAGTTTCCCCTCTCATCTGACCTTAGTTTGCTGCCATCAGTCTAGTGGTTTCGTGGT 

TTCGTCTATTTATTAAAAAATATTTGAGAACAAAAAA2^AAAAAAAAAAAAA 



FIGURE 177 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA55737 
xsubunit 1 of 1, 1141 aa, 1 stop 
xMW: 124671, pi: 5.82, NX(S/T) : 5 

MAGARSRDPWGASGICYLFGSLLVELLFSRAVAFNLDVMGALRKEGEPGSLFGFSVALHRQL 

QPRPQSWLLVGAPQALALPGQQANRTGGLFACPLSLEETDCYRVDIDQGADMQKESKENQWL 

GVSVRSQGPGGKIVTCAHRYEARQRVDQILETRDMIGRCFVLSQDLAIRDELDGGEWKFCEG 

RPQGHEQFGFCQQGTAAAFSPDSHYLLFGAPGTYNWKGTARVELCAQGSADLAHLDDGPYEA 

GGEKEQDPRLIPVPANSYFGFSIDSGKGLVRAEELSFVAGAPRANHKGAWILRKDSASRLV 

PEVMLSGERLTSGFGYSLAVADLNSDGWPDLIVGAPYFFERQEELGGAVYVYLNQGGHWAGI 

SPLRLCGSPDSMFGISLAVLGDLNQDGFPDIAVGAPFDGDGKVFIYHGSSLGWAKPSQVLE 

GEAVGIKSFGYSLSGSLDMDGNQYPDLLVGSLADTAVLFRARPILHVSHEVSIAPRSIDLEQ 

PNCAGGHSVCVDLRVCFSYIAVPSSYSPTVALDYVLDADTDRRLRGQVPRVTFLSRNLEEPK 

HQASGTVWLKHQHDRVCGDAMFQLQENVKDKLRAIWTLSYSLQTPRLRRQAPGQGLPPVAP 

I LNAHQPSTQRAE I HFLKQGCGEDKI CQSNLQLVHARFCTRVSDTEFQPL PMDVDGTTALFA 

LSGQPVIGLELMVTNLPSDPAQPQADGDDAHEAQLLVMLPDSLHYSGVRALDPAEKPLCLSN 

ENASHVECELGNPMKRGAQVTFYL I LSTSGIS I ETTELEVELLLATI SEQELHPVSARARVF 

lELPLSIAGMAIPQQLFFSGWRGERAMQSERDVGSKVKYEVTVSNQGQSLRTLGSAFLNIM 

WPHEIANGKWLLYPMQVELEGGQGPGQKGLCSPRPNILHLDVDSRDRRRRELEPPEQQEPGE 

RQEPSMSWWPVSSAEKKKNITLDCARGTANCWFSCPLYSFDRAAVLHVWGRLWNSTFLEEY 

SAVKSLEVI VRANITVKSS I KNLMLRDASTVI PVMVYLDPMAWAEGVPWWVI LLAVLAGLL 

VLALLVLLLWKMGFFKIiAKHPEATVPQYHAVKIPREDRQQFKEEKTGTILRISnSfW 

DAHPI LAADGHPELGPDGHPGPGTA 



Important features: 
Signal peptide: 

amino acids 1-33 



Transmembrane domain: 

amino acids 1040-1062 



N-glycosylation sites. 

amino acids 86-89, 746-749, 949-952, 985-988 and 1005-1008 



Integrins alpha chain proteins. 

amino acids 1064-1071, 384-408, 1041-1071, 317-346, 443-465, 385- 
407, 215-224, 634-647, 85-99, 322-346, 470-479, 442-466, 379-408 
and 1031-1047 



FIGURE 178 



CGCGCCGGGCGCAGGGAGCTGAGTGGACGGCTCGAGACGGCGGCGCGTGCAGCAGCTCCAGA 

AAGCAGCGAGTTGGCAGAGCAGGGCTGCATTTCCAGCAGGAGCTGCGAGCACAGTGCTGGCT 

CACAACAAGATGCTCAAGGTGTCAGCCGTACTGTGTGTGTGTGCAGCCGCTTGGTGCAGTCA 

GTCTCTCGCAGCTGCCGCGGCGGTGGCTGCAGCCGGGGGGCGGTCGGACGGCGGTAATTTTC 

TGGATGATAAACAATGGCTCACCACAATCTCTCAGTATGACAAGGAAGTCGGACAGTGGAAC 

AAATTCCGAGACGAAGTAGAGGATGATTATTTCCGCACTTGGAGTCCAGGAAAACCCTTCGA 

TCAGGCTTTAGATCCAGCTAAGGATCCATGCTTAAAGATGAAATGTAGTCGCCATAAAGTAT 

GCATTGCTCAAGATTCTCAGACTGCAGTCTGCATTAGTCACCGGAGGCTTACACACAGGATG 

AAAGAAGCAGGAGTAGACCATAGGCAGTGGAGGGGTCCCATATTATCCACCTGCAAGCAGTG 

CCCAGTGGTCTATCCCAGCCCTGTTTGTGGTTCAGATGGTCATACCTACTCTTTTCAGTGCA 

AACTAGAATATCAGGCATGTGTCTTAGGAAAACAGATCTCAGTCAAATGTGAAGGACATTGC 

CCATGTCCTTCAGATAAGCCCACCAGTACAAGCAGAAATGTTAAGAGAGCATGCAGTGACCT 

GGAGTTCAGGGAAGTGGCAAACAGATTGCGGGACTGGTTCAAGGCCCTTCATGAAAGTGGAA 

GTCAAAACAAGAAGACAAAAACATTGCTGAGGCCTGAGAGAAGCAGATTCGATACCAGCATC 

TTGCCAATTTGCAAGGACTCACTTGGCTGGATGTTTAACAGACTTGATACAAACTATGACCT 

GCTATTGGACCAGTCAGAGCTCAGAAGCATTTACCTTGATAAGAATGAACAGTGTACCAAGG 

CATTCTTCAATTCTTGTGACACATACAAGGACAGTTTAATATCTAATAATGAGTGGTGCTAC 

TGCTTCCAGAGACAGCAAGACCCACCTTGCCAGACTGAGCTCAGCAATATTCAGAAGCGGCA 

AGGGGTAAAGAAGCTCCTAGGACAGTATATCCCCCTGTGTGATGAAGATGGTTACTACAAGC 

CAACACAATGTCATGGCAGTGTTGGACAGTGCTGGTGTGTTGACAGATATGGAAATGAAGTC 

ATGGGATCCAGAATAAATGGTGTTGCAGATTGTGCTATAGATTTTGAGATCTCCGGAGATTT 

TGCTAGTGGCGATTTTCATGAATGGACTGATGATGAGGATGATGAAGACGATATTATGAATG 

ATGAAGATGAAATTGAAGATGATGATGAAGATGAAGGGGATGATGATGATGGTGGTGATGAC 

CATGATGTATACATTTj^TTGATGACAGTTGAAATCAATAAATTCTACATTTCTAATATTTA 

CAAAAATGATAGCCTATTTAAAATTATCTTCTTCCCCAATAACAAAATGATTCTAAACCTCA 

CATATATTTTGTATAATTATTTGAAAAATTGCAGCTAAAGTTATAGAACTTTATGTTTAAAT 

AAGAATCATTTGCTTTGAGTTTTTATATTCCTTACACAAAAAGAAAATACATATGCAGTCTA 

GTCAGACAAAATAAAGTTTTGAAGTGCTACTATAATAAATTTTTCACGAGAACAAACTTTGT 

AAATCTTCCATAAGCAAAATGACAGCTAGTGCTTGGGATCGTACATGTTAATTTTTTGAAAG 

ATAATTCTAAGTGAAATTTAAAATAAATAAATTTTTAATGACCTGGGTCTTAAGGATTTAGG 

AAAAATATGCATGCTTTAATTGCATTTCCAAAGTAGCATCTTGCTAGACCTAGATGAGTCAG 

GATAACAGAGAGATACCACATGACTCCAAAAAAAAAAAAAAA 



FIGURE 179 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA49829 
xsubunit 1 of 1, 436 aa, 1 stop 
xMW: 49429, pi: 4.80, NX(S/T): 0 

MLKVSAVLCVCAAAWCSQSLAAAAAVAAAGGRSDGGNFLDDKQWLTTISQYDKEVGQWNKFR 
DEVEDDYFRTWSPGKPFDQALDPAKDPCLKMKCSRHKVCIAQDSQTAVCISHRRLTHRMKEA 
GVDHRQWRGPILSTCKQCPWYPSPVCGSDGHTYSFQCKLEYQACVLGKQISVKCEGHCPCP 
SDKPTSTSRNVKRACSDLEFREVANRLRDWFKALHESGSQNPCKTKTLLRPERSRFDTSILPI 
CKDSLGWMFNRLDTNYDLLLDQSELRSIYLDKDSTEQCTKAFFNSCDTYKIDSLISNlSrEWCYCFQ 
RQQDPPCQTELSNIQKRQGVKKLLGQYIPLCDEDGYYKPTQCHGSVGQCWCVDRYGNEVMGS 
RINGVADCAIDFEISGDFASGDFHEWTDDEDDEDDIMNDEDEIEDDDEDEGDDDDGGDDHDVYI 

Important features: 
Signal peptide: 

amino acids 1-16 

Leucine zipper pattern. 

amino acids 246-267 

N-myristoylation sites. 

amino acids 357-362, 371-376 and 376-381 

Thyroglobulin type-1 repeat proteins 

amino acids 353-365 and 339-352 



FIGURE 180 



CaGACTCCAGATTTCCCTGTCAACC:ACGAGGAGTCCAGAGAGG?^CGCGGAGCGGAGAG?^ 

CTCTTTCAGCCCGGGATCGCCCCAGCAGGGATGGGCGACAAGATCTGGCTGCCCTTCCCCGTGCTCCTTCTGGCC 

GCTCTGCCTCCGGTGCTGCTGCCTGGGGCGGCCGGCTTCACACCTTCCCTCGATAGCGACTTCACCTTTACCCTT 

CCCGCCGGCCAGAAGGAGTGCTTCTACCAGCCCATGCCCCTGAAGGCCTCGCTGGAGATCGAGTACCAAGTTTTA 

GATGGAGCAGGATTAGATATTGATTTCCATCTTGCCTCTCCAGAAGGCAAAACCTTAGTTTTTGAACAAAGAAAA 

TCAGATGGAGTTCa.CACTGTAGAGACTGAAGTTGGTGATTACATGTTCTGCTTTGAayiTACATTCAGCACCATT 

TCTGAGAAGGTGATTTTCTTTGaATTAATCCTGGATAATATGGQAGAACAGGCACAAGAACAAGAAGATTGGAAG 

AAATATATTACTGGCACAGATATATTGGATATGAAACTGGAAGACT^TCCTGGAATCaiTCAAa^GCATCAAGTC 

AGACTAAGCAAAAGTGGGCACATACAAATTCTGCTTAGAGCATTTGAAGCTCGTGATCGAAACATACAAGAAAGC 

AACTTTGATAGAGTCAATTTCTGGTCTATGGTTAATTTAGTGGTCATGGTGGTGGTGTCAGCCATTCAAGTTTAT 

ATGCTGAAGAGTCTGTTTGAAGATAAGAGGAAAAGTAGAACTTAAAACTCCAAACTAGAGTACGTAACATTGAAA 

AATGAGGCATAAAAATGCAATAAACTGTTACyiGTCAAGACCATTAATGGTCTTCTCCAAAATATTTTGAGATATA 

AAAGTAGGAAACAGGTATAATTTTAATGTGAAAATTAAGTCTTCACTXTCTGTGCAAGTAATCCTGCTGATCCAG 

TTGTACTTAAGTGTGTAACAGGAATATTTTGCAGAATATAGGTTTAACTGAATGAAGCCATATTAATAACTGCAT 

TTTCCTAACTTTGAAAAATTTTGCAAATGTCTTAGGTGATTTAAATAAATGAGTATTGGGCCTAATTGCAACACC 

AGTCTGTTTTTAAOVGGTTCTATTACCCAGAACTTTTTTGTAAATGCGGCAGTTACAAATTAACTGTGGAAGTTT 

TCAGTTTTAAGTTATAAATCACCTGAGAATTACCTAATGATGGATTGAATAAATCTTTAGACTACAAAAGCCCAA 

CTTTTCTCTATTTACATATGCATCTCTCCTATAATGTAAATAGAATAATAGCTTTGAAATACAATTAGGTTTTTG 

AGATTTTTATAACCAAATACATTTCAGTGTAACATATTAGCAGAAAGCATTAGTCTTTGTACTTTGCTTACATTC 

CCAAAAGCTGACmTTTTCACGATTCTTAAAAACACAAAGTTACACTTACTAAAATTAGGACATGTTra 

AAATGAAGAATATAGTTTAAAAGCTTCCTCCTCCATAGGGACACATTTTCTCTAACCCTTAACTAAAGTGTAGGA 

TTTTAAAATTAAATGTGAGGTAAAATAAGTTTATTTTTAATAGTATCTGTCAAGTTAATATCTGTCAACAGTTAA 

TAATOVTGTTATGTTAATTTTAACATGATTGCTGACTTGGATAATTCATTATTACCAGCAGTTATGAAGGAAATA 

TTGCTAAAATGATCTGGGCCTACCATAAATAAATATCTCCTTTTCTGAGCTCTAAGAATTATCAGAAAACAGGAA 

AGAATTTAGAAAAACTTGAGAAAACCTAATCCAAAATAAAATTCACTTAAGTAGAACTATAAATAAATATCTAGA 

ATCTGACTGGCTCATCATGACATCCTACTCATAACATAAATCAAAGGAGATGATTAATTTCCAGTTAGCTGGAAG 

AAACTTTGGCTGTAGGTTTTTATTTTCTACAAGAATTCTGGTTTGAATTATTTTTGTAAGCAGGTACATTTTATA 

AAATGTAAGCCCTACTGTAAGGTTTAGCACTGGGTGTACATATTTATTAAAAATTTTTATTATAACAACTTTTAT 

TAAAATGGCCTTTCTGAACACTTTATTTATTGATGTTGAAGTAAGGATTAGAAACATAGACTCCCAAGTTTTAAA 

CACCTAAATGTGAATAACCCATATATACAACAaAGTTTCTGCCATCTAGCTTTTTGAAGTCTATGGGGGTCTTAC 

TCAAGTACTAGTAATTTAACTTCaTCATGAATGAACTATAATTTTTAAGTTATGCCCATTTATAACGTTGTTTAT 

GACTACATTGTGAGTTAGAAACAAACTTAAAATTTGGGGTATAGAACCCCTCAACAGGTTAGTAATGCTGGAATT 

CTTGATGAGCAATAATGATAACCAQAGAGTGATTTCATTTACACTCATAGTAGTATAAAAAGAGATACATTTCCC 

TCTTAGGCCCCTGGGAGAAGAGCAGCTTAGATTTCCCTACTGGCAAGGTTTTTAAAAATGAGGTAAATGCCGTAT 

ATGATCAATTACCTTAATTGGCCAAQAAAATGCTTCAGGTGTCTAGGGGTATCCTCTGCAACACTTGCAGAACAA 

AGGTCAATAAGATCCTTGCCTATGAATACCCCTCCCTTTTGCGCTGTTAAATTTGCAATGAGAAGCAAATTTACA 

GTACCATAACTAATAAAGCAGGGTACAGATATAAACTACTGCATCTTTTCTATAAAACTGTGATTAAGAATTCTA 

CCTCTCCTGTATGGCTGTTACTGTACTGTACTCTCTGACTCCTTACCTAACAATGAATTTGTTACATAATCTTCT 

ACATGTATGATTTGTGCCACTGATCTTAAACCTATGATTCAGTAACTTCTTACCATATAAAAACGATAATTGCTT 

TATTTGGAAAAGAATTTAGGAATACTAAGGACAATTATTTTTATAGACAAAGTAAAAAGACAGATATTTAAGAGG 

CATAACCAAAAAAGCAAAACTTGTAAACAGAGTAAAAATCTTTAATATTTCTAAAGACATACTGTTTATCTGCTT 

CATATGCTTTTTTTAATTTCACTATTCCATTTCTAAATTAAAGTTATGCTAAATTGAGTAAGCTGTTTATCACTT 

AACAGCTCATTTTGTCTTTTTCAATATACAAATTTTAAAAATACTACAATATTTAACTAAGGCCCAACCGATTTC 

CATAATGTAGCAGTTACCGTGTTCACCTCACACTAAGGCCTAGAGTTTGCTCTGATATGCATTTGGATGATTAAT 

GTTATGCTGTTCTTTCATGTGAATGTCAAGACATGGAGGGTGTTTGTAATTTTATGGTAAAATTAATCCTTCTTA 

CACATAATGGTGTCTTAAAATTGACAAAAAATGAGCACTTACAATTGTATGTCTCCTCAAATGAAGATTCTTTAT 

GTGAAATTTTAAAAGACATTGATTCCGCATGTAAGGATTTTTCATCTQAAGTACaATAATGCACAATCAGTGTTG 

CTCAAACTGCTTTATACTTATAAACAGCCATCTTAAATAAGCAACGTATTGTGAGTACTGATATGTATATAATAA 

AAATTATCAAAGGAAAA 



FIGURE 181 

></usr/seqdb2/ sst/DNA/Dnaseqs .min/ss .DNA52196 
xsubunit 1 of 1, 229 aa, 1 stop 
xMW: 26017, pi: 4.73, NX(S/T): 0 

MGDKIWLPFPVLLLAALPPVLLPGAAGFTPSLDSDFTFTLPAGQKECFYQPMPLKASLEIEY 
QVLDGAGLD IDFHLASPEGKTLVFEQRKSDGVHTVETEVGDYMFCFDNTFSTI SEKVI FFEL 
ILDNMGEQAQEQEDWKKY I TGTDI LDMKLEDI LES INS I KSRLSKSGHIQI LLRAFEARDRJSr 
IQESNFDRVNFWSMVNLWMVWSAIQVYMLKSLPEDKRKSRT 

Important features: 
Signal peptide: 

amino acids 1-23 

Transmembrane domain: 

amino acids 195-217 

N-myristoylation site. 

amino acids 43-48 

Tyrosine kinase phosphorylation site. 

amino acids 55-62 



FIGURE 182 



CCATCCCTGAGATCTTTTTATAAAAAACCCAGTCTTTGCTGACCAGACAAAGCATACCAGAT 
CTCACCAGAGA6TCGCAGACACTATGCTGCCTCCCATGGCCCTGCCCAGTGTGTCCTGGATG 
CTGCTTTCCTGCCTCATTCTCCTGTGTCAGGTTCAAGGTGAAGAAACCCAGAAGGAACTGCC 
CTCTCCACGGATCAGCTGTCCCAAAGGCTCCAAGGCCTATGGCTCCCCCTGCTATGCCTTGT 
TTTTGTCACCAAAATCCTGGATGGATGCAGATCTGGCTTGCCAGAAGCGGCCCTCTGGAAAA 
CTGGTGTCTGTGCTCAGTGGGGCTGAGGGATCCTTCGTGTCCTCCCTGGTGAGGAGCATTAG 
TAACAGCTACTCATACATCTGGATTGGGCTCCATGACCCCACACAGGGCTCTGAGCCTGATG 
GAGATGGATGGGAGTGGAGTAGCACTGATGTGATGAATTACTTTGCATGGGAGAAAAATCCC 
TCCACCATCTTAAACCCTGGCCACTGTGGGAGCCTGTCAAGAAGCACAGGATTTCTGAAGTG 
GAAAGATTATAACTGTGATGCAAAGTTACCCTATGTCTGCAAGTTCAAGGAC TAGG GCAGGT 
GGGAAGTCAGCAGCCTCAGCTTGGCGTGCAGCTCATCATGGACATGAGACCAGTGTGAAGAC 
TCACCCTGGAAGAGAATATTCTCCCCAAACTGCCCTACCTGACTACCTTGTCATGATCCTCC 
TTCTTTTTCCTTTTTCTTCACCTTCATTTCAGGCTTTTCTCTGTCTTCCATGTCTTGAGATC 
TCAGAGAATAATAATAAAAATGTTACTTTATAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 183 

< / us r / seqdb2 / s s t /DNA/Dnaseqs . mi n/ s s . DNA5 6 9 6 5 
<siibunit 1 of 1, 175 aa, 1 stop 
<MV!: 19330, pi: 7.25, NX(S/T): 1 

MLPPMALPSVSWMLLSCLILLCQVQGEETQKELPSPRISCPKGSKAYGSPCYALFLSPKSWM 
DADLACQKRPSGKLVSVLSGAEGSFVSSLVRSISNSYSYIWIGLHDPTQGSEPDGDGWEWSS 
TDVMNYFAWEKNPSTILNPGHCGSLSRSTGFLKWKDYNCDAKLPYVCKFKD 

Important features: 
Signal peptide: 

amino acids 1-26 

C-type lectin domain signature. 

amino acids 146-171 



FIGURE 184 



CCAGTCTGTCGCCACCTCACTTGGTGTCTGCTGTCCCCGCCAGGCAAGCCTGGGGTGAGAGC 
ACAGAGGAGTGGGCCGGGACCATGCGGGGGACGCGGCTGGCGCTCCTGGCGCTGGTGCTGGC 
TGCCTGCGGAGAGCTGGCGCCGGCCCTGCGCTGCTACGTCTGTCCGGAGCCCACAGGAGTGT 
CGGACTGTGTCACCATCGCCACCTGCACCACCAACGAAACCATGTGCAAGACCACACTCTAC 
TCCCGGGAGATAGTGTACCCCTTCCAGGGGGACTCCACGGTGACCAAGTCCTGTGCCAGCAA 
GTGTAAGCCCTCGGATGTGGATGGCATCGGCCAGACCCTGCCCGTGTCCTGCTGCAATACTG 
AGCTGTGCAATGTAGACGGGGCGCCCGCTCTGAACAGCCTCCACTGCGGGGCCCTCACGCTC 
CTCCCACTCTTGAGCCTCCGACTGTAGAGTCCCCGCCCACCCCCATGGCCCTATGCGGCCCA 
GCCCCGAATGCCTTGAAGAAGTGCCCCCTGCACCAGGAAAAAAAAAAAAAAAAA 



FIGURE 185 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56405 
<subunit 1 of 1, 125 aa, 1 stop 
<MW: 13115, pi: 5.90, NX(S/T): 1 

MRGTRLALLALVLAACGELAPALRCYVCPEPTGVSDCVTIATCTTNETMCKTTLYSREIVYP 
FQGDSTVTKSCASKCKPSDVDGIGQTLPVSCaSTTELCITVDGAPAmSLHCGALTL 

Important features: 
Signal peptide: 

amino acids 1-17 



N-glycosylation site. 

amino acids 46-49 



FIGURE 186 



CTGCAGTCAGGACTCTGGGACCGCAGGGGGCTCCCGGACCCTGACTCTGCAGCCGAACCGGC 
ACGGTTTCGTGGGGACCCAGGCTTGCAAAGTGACGGTCATTTTCTCTTTCTTTCTCCCTCTT 
GAGTCCTTCTGAGATGATGGCTCTGGGCGCAGCGGGAGCTACCCGGGTCTTTGTCGCGATGG 
TAGCGGCGGCTCTCGGCGGCCACCCTCTGCTGGGAGTGAGCGCCACCTTGAACTCGGTTCTC 
AATTCCAACGCTATCAAGAACCTGCCCCCACCGCTGGGCGGCGCTGCGGGGCACCCAGGCTC 
TGCAGTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACA 
ACTACCAGCCGTACCCGTGCGCAGAGGACGAGGAGT6CGGCACTGATGAGTACTGCGCTAGT 
CCCACCCGCGGAGGGGACGCAGGCGTGCAAATCTGTCTCGCCTGCAGGAAGCGCCGAAAACG 
CTGCATGCGTCACGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTCTT 
CTGATCAAAATCATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTAATGAT 
CATAGCACCTTGGATGGGTATTCCAGAAGAACCACCTTGTCTTCAAAAATGTATCACACCAA 
AGGACAAGAAGGTTCTGTTTGTCTCCGGTCATCAGACTGTGCCTCAGGATTGTGTTGTGCTA 
GACACTTCTGGTCCAAGATCTGTAAACCTGTCCTGAAAGAAGGTCAAGTGTGTACCAAGCAT 
AGGAGAAAAGGCTCTCATGGACTAGAAATATTCCAGCGTTGTTACTGTGGAGAAGGTCTGTC 
TTGCCGGATACAGAAAGATCACCATCAAGCCAGTAATTCTTCTAGGCTTCACACTTGTCAGA 
GACACTAAACCAGCTATCCAAATGCAGTGAACTCCTTTTATATAATAGATGCTATGAAAACC 
TTTTATGACCTTCATCAACTCAATCCTAAGGATATACAAGTTCTGTGGTTTCAGTTAAGCAT 
TCCAATAACACCTTCCAAAAACCTGGAGTGTAAGAGCTTTGTTTCTTTATGGAACTCCCCTG 
TGATTGCAGTAAATTACTGTATTGTAAATTCTCAGTGTGGCACTTACCTGTAAATGCAATGA 
AACTTTTAATTATTTTTCTAAAGGTGCTGCACTGCCTATTTTTCCTCTTGTTATGTAAATTT 
TTGTACACATTGATTGTTATCTTGACTGACAAATATTCTATATTGAACTGAAGTAAATCATT 
TCAGCTTATAGTTCTTAAAAGCATAACCCTTTACCCCATTTAATTCTAGAGTCTAGAACGCA 
AGGATCTCTTGGAATGACAAATGATAGGTACCTAAAATGTAACATGAAAATACTAGCTTATT 
TTCTGAAATGTACTATCTTAATGCTTAAATTATATTTCCCTTTAGGCTGTGATAGTTTTTGA 
AATAAAATTTAACATTTAAAAAAAAAAAAA 



FIGURE 187 

</usr/segdb2/sst/DNA/Dnaseqs.min/ss.DNA57530 
<sxibunit 1 of 1, 266 aa, 1 stop 
<MW: 28672, pi: 8.85, NX(S/T): 1 

MMALGAAGATRVFVAMVAAALGGHPLLGVSATLNSVLNSNAIKNLPPPLGGAAGHPGSAVSA 
APGI LYPGGNKYQTIDNYQPYPCAEDEECGTDEYCASPTRGGDAGVQI CLACRKRRKRCMRH 
AMCCPGNYCKNGICVSSDQNHFRGEIEETITESFGNDHSTLDGYSRRTTLSSKMYHTKGQEG 
SVCLRS SDCASGLCCARHFWS KI CKPVLKEGQVCTKHRRKGSHGLEI FQRCYCGEGLSCRIQ 
KDHHQASNSSRLHTCQRH 

Important features: 
Signal peptide: 

amino acids 1-23 

N-glycosylation site. 

amino acids 256-259 

Fungal Zn(2)-Cys(6) binuclear cluster domain 

amino acids 110-126 



FIGURE 188 



TGTGTTTCCCTGCAGTCAGAATTTGGGACNGCAGGGGTTCCCGGACCTGATTTTGCAGCGGA 




GTAGCGGCGGTTTTCGGCGGCCACCTTNTGCTGGGAGTGAGCGCCACCTTGAATCGGTTTTC 
AATTCCAACGNTATCAAGAACCTGCCCCCACCGNTGGGCGGCGCTGCGGGGCACCCAGGNTT 
TGCAGTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACA 
ATTACCAGCCGTACCCGTGCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCTAGT 
CCCACCCGCGGAGGGGANGCGGGCGTGCAAATNTGTNTNGCCTGCAGGAAGCGCCGAAAACG 
CTGCATGCGTCANGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTNTT 
CTGATCAAAATCATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTAATQAT 
CATAGCACCTTGGATGGG 



FIGURE 189 



GAOSAACCTACCGGTACCGGCCGCGCGOTGGTAGTCGCCGGTGTGGCTGCACCTCACCAATCCCGTGCGCCGCGG 

CTGGGCCGTCGGAGAGTGCGTGTGCTTCTCTCCTGCACGCGGTGCTTGGGCTCGGCCAGGCGGGGTCCGCCGCCA 

GGGTTTGAGGATGGGGGAGTAGCTACAGGAAGCGACCCCGCGATGGCAAGGTATATTTTTGTGGAATGAAAAC^ 

AGTATTAGAAATGAGCTGAAGACCATTCACAGATTAATATTTTTGGGGACAGATTTGTGATGCTTGATTCACCCT 

TGAAGTAATGTAGAGJ^GAAGTTCTCAAATTTGCATATTAOlTCaACTGGAACCAGCAGTGAATCTTAATGTTCAC 

TTAAATO^GAACTTGCATAAGAAAGAGaMGGGAGTCTGGTTAAATAAAGATGACTATATCAGAGACT^ 

GATC^LTTCTCTGTTTTCTGATAGTGTATATGGCCATTTTAGTGGGCACAGATCAGGATTTTTACTIG 

AGTGTCCy^AAACTGCAAGCAGTAGAGAAATAAGACAAGCTTTCaAGAAATTGGCaTTGAAGTTAC^^ 

AAACCCGAATAACCCAAATGCACATGGCGATTTTTTAAAAATAAATAGAGCATATGAAGTACTCAAAGATGAAGA 

TCTACGGAAAAAGTATGACaAATATGGAGAAAAGGGACTTGAGGATAATCAAGGTGGCCAGTATGAAAGCTGGAA 

CTATTATCGTTATGATTTTGGTATTTATGATGATGATCCTGAAATCATAACATTGGAAAGAAGAGAATTTGATGC 

TGCTGTTAATTCTGGAGAACTGTGGTTTGTAAATTTTTACTCCCaWSGCTGTTCACACTGCCATGATTTAGCTCC 

CACATGGAGAGACTTTGCTAAAGAAGTGGATGGGTTACTTCGAATTGGAGCTGTTAACTGTGGTGATGATAGAAT 

GCTTTGCCGAATGAAAGGAGTCAACAGCTATCCCAGTCTCTTCATTTTTCGGTCTGGAATGGCCCCA6TGAAATA 

TCaTGGAGACAGATCaAAGGAGAGTTTAGTGAGTTTTGCAATGCAGCaTGTTAGAAGTACAGTGACaGAACTT 

GACAGGAAATTTTGTCAACTCCaTACAAACTGCTTTTGCTGCTGGTATTGGCTGGCTGATCACTTTTTGTTCAAA 

AGGAGGAGATTGTTT6ACTTCACAGACa.CGACTCAGGCTTAGTGGCATGTTGTTTCTCaACTCa.TTGGA 

AGAAATATATTTGGAAGTAATAGATAATCTTCCAGATTTTGAACTACTTTCGGCAAACACACTAGAGGATCGTTT 

GGCTCATCATCGGTGGCTGTTATTTTTTCATTTTGGAAAAAATGAAAATTCAAATGATCCTQAGCTGAAAAAACT 

AAAAACTCTACTTAAAAATGATCATATTCAAGTTGGCaGGTTTGACTGTTCCTCTGCACCAGACATCTGTAGT^ 

TCTGTATGTTTTTCAGCCGTCTCTAGCS^GTATTTAAAGGACaAGGAACCaAAGAATATGaAATTC^ 

GAAGATTCTATATGATATACTTGCCTTTGCCAAAGAAAGTGTGAATTCTCATGTTACCACGCTTGGACCTCAAAA 

TTTTCCTGCCaATGACAAAGAACCATGGCTTGTTGATTTCTTTGCCCCCTGGTGTCCACCATGTCGAGCTTTACT 

ACCAGAGTTACGAAGAGCy^TCAAATCTTCTTTATGGTCAGCTTAAGTTTGGTACACTAGATTGTACAGTTCATGA 

GGGACTCTGTAACATGTATAACATTCAGGCTTATCCAACAACaGTGGTATTCAACCAGTCCyUiCATTC^ 

TGAAGQACaTCACTCTGCTGAACAAATCTTGGAGTTa^TAGAGGATCTTATGAATCCTTCAGTGGTCTCCCTTAC 

ACCCACCACCTTa^CGAACTAGTTACACaAAGAA?iACACAACGAAGTCTGGATGGTTGATTTCTATTCTCCGT^ 

GTGTCATCCTTGCCyVAGTCTTAATGCCIAGaATGGAAAAGAATGGCCCGGACATTAACTGGACTGATCAACGTGGG 

CAGTATAGATTGCCAACAGTATCATTCTTTTTGTGCCCyiGGAAAACGTTCAAAGATACCCTGAGATAAGATTTTT 

TCCCCCAAAATCAAATAAAGCTTATCAGTATCACAGTTAa^ATGGTTGGAATAGGGATGCTTATTCCCTGAGAAT 

CTGGGGTCTAGGATTTTTACCTCAAGTATCCACAGATCrAACACCTCAGACTTTCAGTGAAAAAGTTCTAC^ 

gaaaaatcattgggtgattgatttctatgctccttggtgtggaccttgccagaattttgctccaqaatttgagct 
cttggctaggatgattaaaggiy^gtgaaagctggaaaagtagactgtc^ggcttatgctcagacatgccag^ 
agctgggato^gggcctatccaactgttaagttttatttctacgaaagagcaaagagaaattttcaagaagagca^ 
gataaatacca^gagatgcaaaagcaatcgctgccttaataagtgaaaaattggaarctctccgaaatcaaggcy^ 
gaggaataaggatgaactttraataatgttgaagatgaaqaaftaagtttaaaagaaattctgacagatgacatcag 
Aagacacctatttagaatgttacatttatgatgggaatgaatqaacattatcttagacttgcagttgtactgcca 
gaattatctacy^gcactggtgtaaaagaagggtctgcaaactttttctgtaaagggccggtttataaatatttta 
gactttgcaggctataatatatggttcacacatgagaacaagaatagagtcatcatgtattctttgttatttgct 
tttaacaacctttaaaaaatattaaaacgattcttagctcagagccatacaaaagtaggctgqattcagtccatg 

GACCATAGATTGCTGTCCCCCTCGACGGACTTATAATGTTTCAGGTGGCTGGCTTGAACATGAGTCTGCTGTGCT 

atctacataaatgtctaagttgtataaagtccactttcccttcacgttttttggctgacctgaaaagaggtaact 
tagtttttggtcacttgttctcctaaaaatgctatccctaaccatatatttatatttcgttttaaaaacacccat 
gatgtggcacagtaaacaaaccctgttatgctgtattattatgaggagattcttcattgttttctttccttctca 
aaggttgaaaaaatgcttttaattttto^cagccgagaaacagtgcagcagtatatgtgcacaa^gtaagtacac 

AAATTTGAGCSy^CAGTAAGTGCACSUUlTTCTGTAGTTTGCTGTATCATCCAGGAAAACCTGAGGGAAAAAA^ 

TAGCAATTAACTGGGCATTGTAGAGTATCCTAAATATGTTATCAAGTATTTAGAGTTCTATATTTTAAAGATATA 

TGTGTTCATGTATTTTCTGAAATTGCTTTCATAGAAATTTTCCCACTGATAGTTGATTTTTGAGGCATCTAATAT 

TTACy^TATTTGCCTTCTGAACTTTGTTTTGACCTGTATCCTTTATTTACATTGGGTTTTTCTTTCATAGTTTTGG 

TTTTTCACTCCTGTCCAGTCTATTTATTATTCAAATAGGAAAAATTACTTTACAGGTTGTTTTACTGTAGCTTAT 

AATQATACTGTAGTTATTCCAGTTACTAGTTTACTGTCAGAGGGCTGCCTTTTTCAGATAAATATTGACATAATA 

ACTGAAGTTATTTTTATAAGAAAATCAAGTATATAAATCTAGGAAAGGGATCTTCTAGTTTCTGTGTTGTTTAGA 

CTCaAAGAATCACAAATTTGTCAGTAACATGTAGTTGTTTAGTTATAATTCAGAGTGTACAGAATGGTAAAAATT 

CCAATCAGTCAAAAGAGGTCAATGAATTAAAAGGCTTGCaACTTTTTCAAAAAAAZ^AAAAAA^^ 



FIGURE 190 

< /usr / seqdb2 / s s t /DNA/Dnaseqs . min/ ss . DNA5 6439 
<subunit 1 of 1, 747 aa, 1 stop 
<MW: 86127, pi: 7.46, NX(S/T): 2 

MGWLNKDDYIRDLKRIILCFLIVYMAILVGTDQDFYSLLGVSKTASSREIRQAFKKLALKL 

HPDKNPNNPNAHGDFLKINRAYEVLKDEDLRKKYDKYGEKGLEDNQGGQYESWNYYRYDFGI 

YDDDPEIITLERREFDAAVNSGELWFVNFYSPGCSHCHDLAPTWRDFAKEVDGLLRIGAVNC 

GDDRMLCRMKGVNSYPSLFIFRSGMAPVKYHGDRSKESLVSFAMQHVRSTVTELWTGNFYNS 

IQTAFAAGIGWLITFCSKGGDCLTSQTRLRLSGMLFLNSLDAKEIYLEVIHNLPDFELLSAN 

TLEDRLAHHRWLLFFHFGKNENSNDPELKKLKTLLKNDHIQVGRFDCSSAPDICSNLYVFQP 

SLAVFKGQGTKEYEIHHGKKILYDILAFAKESVNSHVTTLGPQNFPANDKEPWLVDFFAPWC 

PPCRALLPELRRASNLLYGQLKFGTLDCTVHEGLCNMYNIQAYPTTWFNQSNIHEYEGHHS 

AEQILEFIEDLMNPSWSLTPTTFNELVTQRKHNEVWiyrVDFYSPWCHPCQVLMPEWKRiy!^ 

liTGLINVGSIDCQQYHSFCAQENVQRYPEIRFFPPKSNKAYQYHSYNGWNRDAYSLRIWGLG 

FLPQVSTDLTPQTFSEKVLQGKNHWVIDFYAPWCGPCQNFAPEFELLARMIKGKVKAGKVDC 

QAYAQTCQKAGIRAYPTVKFYFYERAKRNFQEEQimi?DAKAIAALISEKLETLRNQGKRNK^ 

Important features: 

Endoplasmic reticulimi targeting sequence. 

amino acids 744-747 

Cytochrome c family heme-binding site signature. 

amino acids 158-163 

Nt-dnaJ domain signature. 

amino acids 77-96 

N-glycosylation site. 

amino acids 484-487 



FIGURE 191 



AGACAGTACCTCCTCCCTAGGACTACACAAGGACTGAACCAGAAGGAAGAGGACAGAGCAAA 
GCCATGAACATCATCCTAGAAATCCTTCTGCTTCTGATCACCATCATCTACTCCTACTTGGA 
GTCGTTGGTGAAGTTTTTCATTCCTCAGAGGAGAAAATCTGTGGCTGGGGAGATTGTTCTCA 
TTACTGGAGCTGGGCATGGAATAGGCAGGCAGACTACTTATGAATTTGCAAAACGACAGAGC 
ATATTGGTTCTGTGGGATATTAATAAGCGCGGTGTGGAGGAAACTGCAGCTGAGTGCCGAAA 
ACTAGGCGTCACTGCGCATGCGTATGTGGTAGACTGCAGCAACAGAGAAGAGATCTATCGCT 
CTCTAAATCAGGTGAAGAAAGAAGTGGGTGATGTAACAATCGTGGTGAATAATGCTGGGACA 
GTATATCCAGCCGATCTTCTCAGCACCAAGGATGAAGAGATTACCAAGACATTTGAGGTCAA 
CATCCTAGGACATTTTTGGATCACAAAAGCACTTCTTCCATCGATGATGGAGAGAAATCATG 
GCCACATCGTCACAGTGGCTTCAGTGTGCGGCCACGAAGGGATTCCTTACCTCATCCCATAT 
TGTTCCAGCAAATTTGCCGCTGTTGGCTTTCACAGAGGTCTGACATCAGAACTTCAGGCCTT 
GGGAAAAACTGGTATCAAAACCTCATGTCTCTGCCCAGTTTTTGTGAATACTGGGTTCACCA 
AAAATCCAAGCACAAGATTATGGCCTGTATTGGAGACAGATGAAGTCGTAAGAAGTCTGATA 
GATGGAATACTTACCAATAAGAAAATGATTTTTGTTCCATCGTATATCAATATCTTTCTGAG 
ACTACAGAAGTTTCTTCCTGAACGCGCCTCAGCGATTTTAAATCGTATGCAGAATATTCAAT 
TTGAAGCAGTGGTTGGCCACAAAATCAAAATGAAATGAATAAATAAGCTCCAGCCAGAGATG 
TATGCATGATAATGATATGAATAGTTTCGAATCAATGCTGCAAAGCTTTATTTCACATTTTT 
TCAGTCCTGATAATATTAAAAACATTGGTTTGGCACTAGCAGCAGTCAAACGAACAAGATTA 
ATTACCTGTCTTCCTGTTTCTCAAGAATATTTACGTAGTTTTTCATAGGTCTGTTTTTCCTT 
TCATGCCTCTTAAAAACTTCTGTGCTTACATAAACATACTTAAAAGGTTTTCTTTAAGATAT 
TTTATTTTTCCATTTAAAGGTGGACAAAAGCTACCTCCCTAAAAGTAAATACAAAGAGAACT 
TATTTACACAGGGAAGGTTTAAGACTGTTCAAGTAGCATTCCAATCTGTAGCCATGCCACAG 
AATATCAACAAGAACACAGAATGAGTGCACAGCTAAGAGATCAAGTTTCAGCAGGCAGCTTT 
ATCTCAACCTGGACATATTTTAAGATTCAGCATTTGAAAGATTTCCCTAGCCTCTTCCTTTT 
TCATTAGCCCAAAACGGTGCAACTCTATTCTGGACTTTATTACTTGATTCTGTCTTCTGTAT 
AACTCTGAAGTCCACCAAAAGTGGACCCTCTATATTTCCTCCCTTTTTATAGTCTTATAAGA 
TACATTATGAAAGGTGACCGACTCTATTTTAAATCTCAGAATTTTAAGTTCTAGCCCCATGA 
TAACCTTTTTCTTTGTAATTTATGCTTTCATATATCCTTGGTCCCAGAGATGTTTAGACAAT 
TTTAGGCTCAAAAATTAAAGCTAACACAGGAAAAGGAACTGTACTGGCTATTACATAAGAAA 
CAATGGACCCAAGAGAAGAA 



FIGURE 192 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA56409 
<siibunit 1 of 1, 300 aa, 1 stop 
<MW: 33655, pi: 9.31, NX(S/T): 1 

MNI I LE I LLLL ITI I YS YLESLVKFFI PQRRKSVAGE I VLITGAGHGIGRQTTYEFAKRQS I 
LVLWDINKRGVEETAAECRKLGWAHAYVVDCSNREEIYRSmQVKKEVGDVTIVVNNAGW 
YPADLLSTKDEEITKTFEVNILGHFWITKALLPSMMERNHGHIVTVASVCGHEGIPYLIPYC 
S S KFAAVGFHRGLTS ELQALGKTGI KTS CLCPVFVNTGFTKNPSTRLWPVLETDE WRSL ID 
GI LTNKKMI FVPS YINI FLRLQKFLPERAS AI LNRMQNIQFE AWGHKI KMK 

Important features : 
Signal peptide: 

amino acids 1-19 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 30-33 and 58-61 

Short-chain alcohol dehydrogenase family protein 

amino acids 165-202, 37-49, 112-122 and 210-219 



FIGURE 193 



CGGCGGCGGCTGCGGGCGCGAGGTGAGGGGCGCGAGGTGAGGGGCGCGAGGTTCCCAGCAGG 
ATGCCCCGGCTCTGCAGGAAGCTGAAGTGAGAGGCCCGGAGAGGGCCCAGCCCGCCCGGGGC 
AGGATGACCAAGGCCCGGCTGTTCCGGCTGTGGCTGGTGCTGGGGTCGGTGTTCATGATCCT 
GCTGATCATCGTGTACTGGGACAGCGCAGGCGCCGCGCACTTCTACTTGCACACGTCCTTCT 
CTAGGCCGCACACGGGGCCGCCGCTGCCCACGCCCGGGCCGGACAGGGACAGGGAGCTCACG 
GCCGACTCCGATGTCGACGAGTTTCTGGACAAGTTTCTCAGTGCTGGCGTGAAGCAGAGCGA 
CCTTCCCAGAAAGGAGACGGAGCAGCCGCCTGCGCCGGGGAGCATGGAGGAGAGCGTGAGAG 
GCTACGACTGGTCCCCGCGCGACGCCCGGCGCAGCCCAGACCAGGGCCGGCAGCAGGCGGAG 
CGGAGGAGCGTGCTGCGGGGCTTCTGCGCCAACTCCAGCCTGGCCTTCCCCACCAAGGAGCG 
CGCATTCGACGACATCCCCAACTCGGAGCTGAGCCACCTGATCGTGGACGACCGGCACGGGG 
CCATCTACTGCTACGTGCCCAAGGTGGCCTGCACCAACTGGAAGCGCGTGATGATCGTGCTG 
AGCGGAAGCCTGCTGCACCGCGGTGCGCCCTACCGCGACCCGCTGCGCATCCCGCGCGAGCA 
CGTGCACAACGCCAGCGCGCACCTGACCTTCAACAAGTTCTGGCGCCGCTACGGGAAGCTCT 
CCCGCCACCTCATGAAGGTCAAGCTCAAGAAGTACACCAAGTTCCTCTTCGTGCGCGACCCC 
TTCGTGCGCCTGATCTCCGCCTTCCGCAGCAAGTTCGAGCTGGAGAACGAGGAGTTCTACCG 
CAAGTTCGCCGTGCCCATGCTGCGGCTGTACGCCAACCACACCAGCCTGCCCGCCTCGGCGC 
GCGAGGCCTTCCGCGCTGGCCTCAAGGTGTCCTTCGCCAACTTCATCCAGTACCTGCTGGAC 
CCGCACACGGAGAAGCTGGCGCCCTTCAACGAGCACTGGCGGCAGGTGTACCGCCTCTGCCA 
CCCGTGCCAGATCGACTACGACTTCGTGGGGAAGCTGGAGACTCTGGACGAGGACGCCGCGC 
AGCTGCTGCAGCTACTCCAGGTGGACCGGCAGCTCCGCTTCCCCCCGAGCTACCGGAACAGG 
ACCGCCAGCAGCTGGGAGGAG6ACTGGTTCGCCAAGATCCCCCTGGCCTGGAGGCAGCAGCT 
GTATAAACTCTACGAGGCCGACTTTGTTCTCTTCGGCTACCCCAAGCCCGAAAACCTCCTCC 
GAGACTGAAAGCTTTCGCGTTGCTTTTTCTCGCGTGCCTGGAACCTGACGCACGCGCACTCC 
AGTTTTTTTATGACCTACGATTTTGCAATCTGGGCTTCTTGTTCACTCCACTGCCTCTATCC 
ATTGAGTACTGTATCGATATTGTTTTTTAAGATTAATATATTTCAGGTATTTAATACGA 



FIGURE 194 

< /us r/ s eqdb2 / s s t /DNA/Dnaseqs . min/ s s - DNA5 6 1 12 
<subunit 1 of 1, 414 aa, 1 stop 
<MW: 48414, pi: 9.54, NX(S/T) : 4 

MTKARLFRLWLVLGS VFMI LL I I VYWDS AGAAHFYLHTSFSRPHTGPPLPTPGPDRDRELTA 
DSDVDEFLDKFLSAGVKQSDLPRKETEQPPAPGSMEESVRGYDWSPRDARRSPDQGRQQAER 
RSVLRGFCANSSLAFPTKERAFDDIPNSELSHLIVDDRHGAIYCYVPKVACTlSrWKRVMIVLS 
GS LI.HRGAPYRDPLRI PREHVHNASAHLTFNKFWRRYGKLSRHmKVKLKKYTKFLFVRDPF 
VRLISAFRSKFELENEEFYRKFAVPMLRLYANHTSLPASAREAFRAGLKVSFANFIQYLLDP 
HTEKLAPFNEHWRQVYRLCHPCQIDYDFVGKLETIiDEDAAQLLQLLQVDRQLRFPPSYRNRT 
ASSWEEDWFAKIPLAWRQQLYKLYEADFVLFGYPKPENLLRD 

Important features: 
Signal peptide: 

amino acids 1-31 

N-glycosylation sites. 

amino acids 134-137, 209-212, 280-283 and 370-373 

TNFR/NGFR family cysteine- rich region protein 

amino acids 329-332 



"ni 



FIGURE 195 



TCGGGCCAGAATTCGGCACGAGGCGGCACGAGGGCGACGGCCTCACGGGGCTTTGGAGGTGA 
AAGAGGCCCAGAGTAGAGAGAGAGAGAGACCGACGTACACGGGATGGCTACGGGAACGCGCT 
ATGCCGGGAAGGTGGTGGTCGTGACCGGGGGCGGGCGCGGCATCGGAGCTGGGATCGTGCGC 
GCCTTCGTGAACA6CGGGGCCCGAGTGGTTATCTGCGACAAG6ATGAGTCTGGGGGCCGGGC 
CCTGGAGCAGGAGCTCCCTGGAGCTGTCTTTATCCTCTGTGATGTGACTCAGGAAGATGATG 
TGAAGACCCTGGTTTCTGAGACCATCCGCCGATTTGGCCGCCTGGATTGTGTTGTCAACAAC 
GCTGGCCACCACCCACCCCCACAGAGGCCTGAGGAGACCTCTGCCCAGGGATTCCGCCAGCT 
GCTGGAGCTGAACCTACTGGGGACGTACACCTTGACCAAGCTCGCCCTCCCCTACCTGCGGA 
AGAGTCAAGGGAAT6TCATCAACATCTCCAGCCTGGTGGGGGCAATCGGCCAGGCCCAGGCA 
GTTCCCTATGTGGCCACCAAGGGGGCAGTAACAGCCATGACCAAAGCTTTGGCCCTGGATGA 
AAGTCCATATGGTGTCCGAGTCAACTGTATCTCCCCAGGAAACATCTGGACCCCGCTGTGGG 
AGGAGCTGGCAGCCTTAATGCCAGACCCTAGGGCCACAATCCGAGAGGGCATGCTGGCCCAG 
CCACTGGGCCGCATGGGCCAGCCCGCTGAGGTCGGGGCTGCGGCAGTGTTCCTGGCCTCCGA 
AGCCAACTTCTGCACGGGCATTGAACTGCTCGTGACGGGGGGTGCAGAGCTGGGGTACGGGT 
GCAAGGCCAGTCGGAGCACCCCCGTGGACGCCCCCGATATCCCTTCCTGATTTCTCTCATTT 
CTACTTGGGGCCCCCTTCCTAGGACTCTCCCACCCCAAACTCCAACCTGTATCAGATGCAGC 
CCCCAAGCCCTTAGACTCTAAGCCCAGTTAGCAAGGTGCCGGGTCACCCTGCAGGTTCCCAT 
AAAAACGATTTGCAGCC 



FIGURE 196 



< /us r / seqdb2 / s s t /DNA/Dnaseqs . min/ s s . DNA5 6045 
<subunit 1 of 1, 270 aa, 1 stop 
<MW: 28317, pi: 6.00, NX(S/T) : 1 

MATGTRYAGKWWTGGGRGIGAGIVRAFVNSGARWICDKDESGGRALEQELPGAVFILCD 
VTQEDDVKTLVSETIRRFGRLDCWNNAGHHPPPQRPEETSAQGFRQLLELNLLGTYTLTKL 
ALPYLRKSQGNVINISSLVGAIGQAQAVPYVATKGAVTAMTKALALDESPYGVRVNCISPGN 
IWTPLWEELAALMPDPRATIREGMLAQPLGRMGQPAEVGAAAVFLASEANFCTGIELLVTGG 
AELGYGCKASRSTPVDAPDIPS 

Important features: 
N-glycosylation site. 

amino acids 138-141 

Short-chain alcohol dehydrogenase family protein 

amino acids 10-22, 81-91, 134-171 and 176-185 



FIGURE 197 



AGGCGGGCAGCAGCTGCAGGCTGACCTTGCAGCTTGGCGGA ATGG ACTGGCCTCACAACCTG 
CTGTTTCTTCTTACCATTTCCATCTTCCTGGGGCTGGGCCAGCCCAGGAGCCCCAAAAGCAA 
GAGGAAGGGGCAAGGGCGGCCTGGGCCCCTGGCCCCTGGCCCTCACCAGGTGCCACTGGACC 
TGGTGTCACGGATGAAACCGTATGCCCGCATGGAGGAGTATGAGAGGAACATCGAGGAGATG 
GTGGCCCAGCTGAGGAACAGCTCAGAGCTGGCCCAGAGAAAGTGTGAGGTCAACTTGCAGCT 
GTGGATGTCCAACAAGAGGAGCCTGTCTCCCTGGGGCTACAGCATCAACCACGACCCCAGCC 
GTATCCCCGTGGACCTGCCGGAGGCACGGTGCCTGTGTCTGGGCTGTGTGAACCCCTTCACC 
ATGCAGGAGGACCGCAGCATGGTGAGCGTGCCGGTGTTCAGCCAGGTTCCTGTGCGCCGCCG 
CCTCTGCCCGCCACCGCCCCGCACAGGGCCTTGCCGCCAGCGCGCAGTCATGGAGACCATCG 
CTGTGGGCTGCACCTGCATCTTCTGAATCACCTGGCCCAGAAGCCAGGCCAGCAGCCCGAGA 
CCATCCTCCTTGCACCTTTGTGCCAAGAAAGGCCTATGAAAAGTAAACACTGACTTTTGAAA 
GCAAG 



FIGURE 198 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA59294 
<subunit 1 of 1, 180 aa, 1 stop 
<MW: 20437, pi: 9.58, NX{S/T): 1 

MDWPHNLLFLLTISIFLGLGQPRSPKSKRKGQGRPGPLAPGPHQVPLDLVSRMKPYARMEEY 
ERNI EEMVAQLRNSSEIiAQRKCEVNLQLWMSNKRSLSPWGYS INHDPSRI PVDLPEARCLCL 
GCVNPFTMQEDRSMVSVPVFSQVPVRRRLCPPPPRTGPCRQRAVMETIAVGCTCIF 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 75-78 

Homologous region to IL-1.7 

amino acids 96-180. 



FIGURE 199 



GCGCCGCCAGGCGTAGGCGGGGTGGCCCTTGCGTCTCCCGCTTCCTTGAAAAACCCGGCGGG 
CGAGCGAGGCTGCGGGCCGGCCGCTGCCCTTCCCCACACTCCCCGCCGAGAAGCCTCGCTCG 
GCGCCCAACATGGCGGGTGGGCGCTGCGGCCCGCAGCTAACGGCGCTCCTGGCCGCCTGGAT 
CGCGGCTGTGGCGGCGACGGCAGGCCCCGAGGAGGCCGCGCTGCCGCCGGAGCAGAGCCGGG 
TCCAGCCCATGACCGCCTCCAACTGGACGCTGGTGATGGAGGGCGAGTGGATGCTGAAATTT 
TACGCCCCATGGTGTCCATCCTGCCAGCAGACTGATTCAGAATGGGAGGCTTTTGCAAAGAA 
TGGTGAAATACTTCAGATCAGTGTGGGGAAGGTAGATGTCATTCAAGAACCAGGTTTGAGTG 
GCCGCTTCTTTGTCACCACTCTCCCAGCATTTTTTCATGCAAAGGATGGGATATTCCGCCGT 
TATCGTGGCCCAGGAATCTTCGAAGACCTGCAGAATTATATCTTAGAGAAGAAATGGCAATC 
AGTCGAGCCTCTGACTGGCTGGAAATCCCCAGCTTCTCTAACGATGTCTGGAATGGCTGGTC 
TTTTTAGCATCTCTGGCAAGATATGGCATCTTCACAACTATTTCACAGTGACTCTTGGAATT 
CCTGCTTGGTGTTCTTATGTGTTTTTCGTCATAGCCACCTTGGTTTTTGGCCTTTTTATGGG 
TCTGGTCTTGGTGGTAATATCAGAATGTTTCTATGTGCCACTTCCAAGGCATTTATCTGAGC 
GTTCTGAGCAGAATCGGAGATCAGAGGAGGCTCATAGAGCTGAACAGTTGCAGGATGCGGAG 
GAGGAAAAAGATGATTCAAATGAAGAAGAAAACAAAGACAGCCTTGTAGATGATGAAGAAGA 
GAAAGAAGATCTTGGCGATGAGGATGAAGCAGAGGAAGAAGAGGAGGAGGACAACTTGGCTG 
CTGGTGTGGATGAGGAGAGAAGTGAGGCCAATGATCAGGGGCCCCCAGGAGAGGACGGTGTG 
ACCCGGGAGGAAGTAGAGCCTGAGGAGGCTGAAGAAGGCATCTCTGAGCAACCCTGCCCAGC 
TGACACAGAGGTGGTGGAAGACTCCTTGAGGCAGCGTAAAAGTCAGCATGCTGACAAGGGAC 
TGTAGATTTAATGATGCGTTTTCAAGAATACACACCAAAACAATATGTCAGCTTCCCTTTGG 
CCTGCAGTTTGTACCAAATCCTTAATTTTTCCTGAATGAGCAAGCTTCTCTTAAAAGATGCT 
CTCTAGTCATTTGGTCTCATGGCAGTAAGCCTCATGTATACTAAGGAGAGTCTTCCAGGTGT 
GACAATCAGGATATAGAAAAACAAACGTAGTGTTGGGATCTGTTTGGAGACTGGGATGGGAA 
CAAGTTCATTTACTTAGGGGTCAGAGAGTCTCGACCAGAGGAGGCCATTCCCAGTCCTAATC 
AGCACCTTCCAGAGACAAGGCTGCAGGCCCTGTGAAATGA?^GCCAAGCAGGAGCCTTGGCT 
CCTGAGCATCCCCAAAGTGTAACGTAGAAGCCTTGCATCCTTTTCTTGTGTAAAGTATTTAT 
TTTTGTCAAATTGCAGGAAACATCAGGCACCACAGTGCATGAAAAATCTTTCACAGCTAGAA 
ATTGAAAGGGCCTTGGGTATAGAGAGCAGCTCAGAAGTCATCCCAGCCCTCTGAATCTCCTG 
TGCTATGTTTTATTTCTTACCTTTAATTTTTCCAGCATTTCCACCATGGGCATTCAGGCTCT 
CCACACTCTTCACTATTATCTCTTGGTCAGAGGACTCCAATAACAGCCAGGTTTACATGAAC 
TGTGTTTGTTCATTCTGACCTAAGGGGTTTAGATAATCAGTAACCATAACCCCTGAAGCTGT 
GACTGCCAAACATCTCAAATGAAATGTTGTGGCCATCAGAGACTCAAAAGGAAGTAAGGATT 
TTACAAGACAGATTAAAAAAAAATTGTTTTGTCCAAAATATAGTTGTTGTTGATTTTTTTTT 
AAGTTTTCTAAGCAATATTTTTCAAGCCAGAAGTCCTCTAAGTCTTGCCAGTACAAGGTAGT 
CTTGTGAAGAAAAGTTGAATACTGTTTTGTTTTCATCTCAAGGGGTTCCCTGGGTCTTGAAC 
TACTTTAATAATAACTAAAAAACCACTTCTGATTTTCCTTCAGTGATGTGCTTTTGGTGAAA 
GAATTAATGAACTCCAGTACCTGAAAGTGAAAGATTTGATTTTGTTTCCATCTTCTGTAATC 
TTCCAAAGAATTATATCTTTGTAAATCTCTCAATACTCAATCTACTGTAAGTACCCAGGGA6 
GCTAATTTCTTT 



FIGURE 200 

< /us r / s eqdb2 / s s t / DNA/Dnas eqs . mi n/ s s . DNA5 6433 
<subunit 1 of 1, 349 aa, 1 stop 
<MW: 38952, pi: 4.34, NX(S/T): 1 

MAGGRCGPQLTALLAAWIAAVAATAGPEEAALPPEQSRVQPMTASNWTLVMEGEWMLKFYAP 
WCPSCQQTDSEWEAFAKNGEILQI SVGKVDVIQEPGLSGRFFVTTLPAFFHAKDGI FRRYRG 
PGIFEDLQlSryiLEKKWQSVEPLTGWKSPASLTMSGMAGLFSISGKIWHLHNYFTVTLGIPAW 
CSYVFFVIATLVFGLFMGLVLWISECFYVPLPRHLSERSEQNRRSEEAHRAEQLQDAEEEK 
DDSNEEENKDSLVDDEEEKEDLGDEDEAEEEEEEDNLAAGVDEERSEANDQGPPGEDGVTRE 
EVEPEEAEEGI SEQPCPADTEWEDSLRQRKSQHADKGL 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 191-211 

N-glycosylation site. 

amino acids 46-49 

Thioredoxin family proteins. (homologous region to disulfide 

isomerase) 

amino acids 56-72 



Flavodoxin proteins 

amino acids 173-187 



FIGURE 201 



ATCTGGTTGAACTACTTAAGCTTAATTTGTTAAACTCCGGTAAGTACCTAGCCCACATGATT 
TGACTCAGAGATTCTCTTTTGTCCACAGACAGTCATCTCAGGGGCAGAAAGAAAAGAGCTCC 
CAAATGCTATATCTATTCAGGGGCTCTCAAGAACA ATGG AATATCATCCTGATTTAGAAAAT 
TTGGATGAAGATGGATATACTCAATTACACTTCGACTCTCAAAGCAATACCAGGATAGCTGT 
TGTTTCAGAGAAAGGATCGTGTGCTGCATCTCCTCCTTGGCGCCTCATTGCTGTAATTTTGG 
GAATCCTATGCTTGGTAATACTGGTGATAGCTGTGGTCCTGGGTACCATGGGGGTTCTTTCC 
AGCCCTTGTCCTCCTAATTGGATTATATATGAGAAGAGCTGTTATCTATTCAGCATGTCACT 
AAATTCCTGGGATGGAAGTAAAAGACAATGCTGGCAACTGGGCTCTAATCTCCTAAAGATAG 
ACAGCTCAAATGAATTGGGATTTATAGTAAAACAAGTGTCTTCCCAACCTGATAATTCATTT 
TGGATAGGCCTTTCTCGGCCCCAGACTGAGGTACCATGGCTCTGGGAGGATGGATCAACATT 
CTCTTCTAACTTATTTCAGATCAGAACCACAGCTACCCAAGAAAACCCATCTCCAAATTGTG 
TATGGATTCACGTGTCAGTCATTTATGACCAACTGTGTAGTGTGCCCTCATATAGTATTTGT 
GAGAAGAAGTTTTCAATGTAAGAGGAAGGGTGGAGAAGGAGAGAGAAATATGTGAGGTAGTA 
AGGAGGACAGAAAACAGAACAGAAAAGAGTAACAGCTGAGGTCAAGATAAATGCAGAAAATG 
TTTAGAGAGCTTGGCCAACTGTAATCTTAACCAAGAAATTGAAGGGAGAGGCTGTGATTTCT 
GTATTTGTCGACCTACAGGTAGGCTAGTATTATTTTTCTAGTTAGTAGATCCCTAGACATGG 
AATCAGGGCAGCCAAGCTTGAGTTTTTATTTTTTATTTATTTATTTTTTTGAGATAGGGTCT 
CACTTTGTTACCCAGGCTGGAGTGCAGTGGCACAATCTCGACTCACTGCAGCTATCTCTCGC 
CTCAGCCCCTCAAGTAGCTGGGACTACAGGTGCATGCCACCATGCCAGGCTAATTTTTGGTG 
TTTTTTGTAGAGACTGGGTTTTGCCATGTTGACCAAGCTGGTCTCTAACTCCTGGGCTTAAG 
TGATCTGCCCGCCTTGGCCTCCCAAAGTGCTGGGATTACAGATGTGAGCCACCACACCTGGC 
CCCAAGCTTGAATTTTCATTCTGCCATTGACTTGGCATTTACCTTGGGTAAGCCATAAGCGA 
ATCTTAATTTCTGGCTCTATCAGAGTTGTTTCATGCTCAACAATGCCATTGAAGTGCACGGT 
GTGTTGCCACGATTTGACCCTCAACTTCTAGCAGTATATCAGTTATGAACTGAGGGTGAAAT 
ATATTTCTGAATAGCTAAATGAAGAAATGGGAAAAAATCTTCACCACAGTCAGAGCAATTTT 
ATTATTTTCATCAGTATGATCATAATTATGATTATCATCTTAGTAAAAAGCAGGAACTCCTA 
CTTTTTCTTTATCAATTAAATAGCTCAGAGAGTACATCTGCCATATCTCTAATAGAATCTTT 
TTTTTTTTTTTTTTTTTTTGAGACAGAGTTTCGCTCTTGTTGCCCAGGCTGGAGTGCAACGG 
CACGATCTCGGCTCACCGCAACCTCCGCCCCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCT 
CCCAAGTAGCTGGGATTACAGTCAGGCACCACCACACCCGGCTAATTTTGTATTTTTTTAGT 
AGAGACAGGGTTTCTCCATGTCGGTCAGGGTAGTCCCGAACTCCTGACCTCAAGTGATCTGC 
CTGCCTCGGCCTCCCAAGTGCTGGGATTACAGGCGTGAGCCACTGCACCCAGCCTAGAATCT 
TGTATAATATGTAATTGTAGGGAAACTGCTCTCATAGGAAAGTTTTCTGCTTTTTAAATACA 
AAAATACATAAAAATACATAAAATCTGATGATGAATATAAAAAAGTAACCAACCTCATTGGA 
ACAAGTATTAACATTTTGGAATATGTTTTATTAGTTTTGTGATGTACTGTTTTACAATTTTT 
ACCATTTTTTTCAGTAATTACTGTAAAATGGTATTATTGGAATGAAACTATATTTCCTCATG 
TGCTGATTTGTCTTATTTTTTTCATACTTTCCCACTGGTGCTATTTTTATTTCCAATGGATA 
TTTCTGTATTACTAGGGAGGCATTTACAGTCCTCTAATGTTGATTAATATGTGAAAAGAAAT 
TGTACCAATTTTACTAAATTATGCAGTTTAAAATGGATGATTTTATGTTATGTGGATTTCAT 
TTCAATAAAAAAAAACTCTTATCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 202 

</usr/seqdb2/sst/DNA/Dnasegs.min/ss.DNA53912 
<siibunit 1 of 1, 201 aa, 1 stop 
<MW: 22563, pi: 4.87, NX(S/T) : 1 

MEYHPDLENLDEDGYTQLHFDSQSNTRIAWSEKGSCT^SPPWRLIAVILGILCLVILVIAV 
VLGTMGVLSSPCPPNWIIYEKSCYLFSMSLNSWDGSKRQCWQLGSNLLKIDSSNELGFIVKQ 
VSSQPDNSFWIGLSRPQTEVPWLWEDGSTFSSNLFQIRTTATQENPSPNCVWIHVSVIYDQL 
CSVPSYS I CEKKFSM 

Important features: 

Type II transmembrane domain: 

amino acids 45-65 

CAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 197-200 

N-myristoylation sites. 

amino acids 35-40 and 151-156 

Homologous region to LDL receptor 

amino acids 34-67 and 70-200. 



FIGURE 203 



GGAAGGGGAGGAGCAGGCCACaLCAGGCACAGGCCGGTGAGGGACCTGCCCT^Ga^CCTGGAGGGTCT^^ 

Ca.CyiGGCTGGAGTGCAGTGGTGTGATCTTGGCTCATCGTaACCTCC:ACCTCCCGGGTTC^^GTGATTC^ 

TCaGCCTCCCGAGTAGCTGGGATTAC?lGGTGGTGACTTCCAAGAGTGACTCCGTCGGAGGRA AATQR CTCCCCAG 

TCGCTGCTGCAGACGACACTGTTCCTGCTGAGTCTGCTCTTCCTGGTCCAAGGTGCCCACGGCAGGGGCCACAGG 

GAAGACTTTCGCTTCTGCAGCCAGCGGAACCAGACa^CACAGGAGCAGCCTCCaOTAaW^CCC^ 

CGC:ATCTCCa.TCGAGAACTCCGAAGAGGCCCTCaLCAGTCCATGCCCCTTTCCCTGCAGCCCACCCTGCTTCCCGA 

TCCTTCCCTGACCCCAGGGGCCTCTACCACTTCTGCCTCTACTGGAACCGACATGCTGGGAGATTACATCTTCTC 

TATGGCTIAGCGTGACTTCTTGCTGAGTGACAAAGCCTCTAGCCTCCTCTGCTTCCAGCACCAGGAGGAGAGCCTG 

GCTCAGGGCCCCCCGCTGTTAGCCACTTCTGTCACCTCCTGGTGGAGCCCTCAGAACATCAGCCTGCCCAGTGCC 

GCCAGCTTCACCTTCTCCTTCCACAGTCCTCCCCACACGGCCGCTCACAATGCCTCGGTGGACATGTGCGAGCTC 

AAAAGGGACCTCCAGCTGOTCAGCCAGTTCCTGAAGCATCCCCAGAAGGCCTCAAGGAGGCCCTCGGCTGCCCC^ 

GCCAGCCAGCAGTTGCAGAGCCTGGAGTCGAAACTGACCTCTGTGAGATTCATGGGGGACATGGTGTCCTTCGAG 

GAGGACCGGATC»LACGCCACGGTGTGGAAGCTCCa.GCCCAC:AGCCGGCCTCCAGGACCTGCACATCCACTCCCGG 

CaGGAGGAGGAGCAGAGCGAGATCATGGAGTACTCGGTGCTGCTGCCTCGAAOlCTCTTCCAGAGGACGaAAGGC 

CGGAGCGGGGAGGCTGAGAAGAGACTCCTCCTGGTGGACTTCAGCAGCCAAGCCCTGTTCCAGGACAAGAATTCC 

AGCCAAGTCCTGGGTGAGAAGGTCTTGGGGATTGTGGTACaVGAACACCAAAGTAGCCaACCTCACGGAGCCCGTG 

GTGCTCACTTTCCAGCACCAGCTACAGCCGAAGAATGTGACTCTGCAATGTGTGTTCTGGGTTGAAGACCCCACA 

TTGAGCAGCCCGGGGCATTGGAGCAGTGCTGGGTGTGAGACCGTCAGGAGAGAAACCCAAACATCCTGCTTCTGC 

AACCACTTGACCTACTTTGCAGTGCTGATGGTCTCCTCGGTGGAGGTGGACGCCGTGCACAAGCACTACCTGAGC 

CTCCTCTCCTACGTGGGCTGTGTCGTCTCTGCCCTGGCCTGCCTTGTCACCATTGCCGCCTACCTCTGCTCCAGG 

GTGCCCCTGCCGTGCAGGAGGAAACCTCGGGACTACACCATCAAGGTGCACATGAACCTGCTGCTGGCCGTCTTC 

CTGCTGGACACGAGCTTCCTGCTCAGCGAGCCGGTGGCCCTGACAGGCTCTGAGGCTGGCTGCCGAGCCAGTGCC 

ATCTTCCTGCACTTCTCCCTGCTCACCTGCCTTTCCTGGATGGGCCTCGAGGGGTACAACCTCrACCGACTCGTG 

GTGGAGGTCTTTGGCACCTATGTCCCTGGCTACCTACTCAAGCTGAGCGCCATGGGCTGGG6CTTCCCCATCTTT 

CTGGTGACGCTGGTGGCCCTGGTGGATGTGGACAACTATGGCCCCATCATCTTGGCTGTGCSITAGGACTCCAGAG 

GGCGTCATCTACCCTTCCATGTGCTGGATCCGGGACTCCCTGGTCAGCTACATCACCAACCTGGGCCTCTTCAGC 

CTGGTGTTTCTGTTCAACATGGCCATGCTAGCaiCCATGGTGGTGCAGATCCTGCGGCTGCGCCCCCACACCC^ 

AAGTGGTCACATGTGCTGACACTGCTGGGCCTCAGCCTGGTCCTTGGCCTGCCCTGGGCCTTGATCTTCTTCTCC 

TTTGCTTCTGGCACCTTCCAGCTTGTCGTCCTCTACCTTTTCAGCATCaTCACCTCCTTCa^GGCTTCCT 

TTCATCTGGTACTGGTCCATGCGGCTGCAGGCCCGGGGTGGCCCCTCCCCTCTGAAGAGCAACTCAGACAGCGCC 

AGGCTCCCCATCaGCTCGGGCaGCACCTCGTCCAGCCGCATCTAfiGCCTCCAGCCCACCTGCCCA^^^ 

CAGAGATGCGGCCTCGTCGCACACTGCCTGTGGCCCCCGAGCa^GGCCCAGCCCCAGGCCAGTCAGCCGCAGA 

TTGGAAAGCCCa^CGACCATGGAGAGATGGGCCGTTGCCATGGTGGACGGACTCCCGGGCTGGGCTTTTGAATTG 

GCCTTGGGGACTACTCGGCTCTCACTCAGCTCCCACGGGACTCAGAAGTGCGCCGCCATGCTGCCTAGGGTACTG 

TCCCCACATCTGTCCCAACCCAGCTGGAGGCCTGGTCTCTCCrTACAACCCCTGGGCCCAGCCCTCATTGCTGGG 

GGCCAGGCCTTGGATCTTGAGGGTCTGGCaLCATCCTTAATCCTGTGCCCCTGCCTGGGACAGAAATGTGGCTCCA 

GTTGCTCTGTCTCTCGTGGTCACCCTGAGGGCACTCTGCATCCTCTGTCATTTTAACCTCAGGTGGCACCCAGGG 

CGAATGGGGCCCAGGGCAGACCTTCaGGGCCaGAGCCCTGGCGGAGGAGAGGCCCTTTGCCaGGAGCACaLGCy^ 

AGCTCGCCTACCTCTGAGCCCAGGCCCCCTCCCTCCCTCAGCCCCCCAGTCCTCCCTCCATCTTCCCTGGGGTTC 

TCCTCCTCTCCCAGGGCCTCCTTGCTCCTTCGTTCACAGCTGGGGGTCCCCGATTCCAATGCTGTTTTTTGGGGA 

GTGGTTTCCAGGAGCTGCCTGGTGTCTGCTGTAAATGTTTGTCTACTGCACAAGCCTCGGCCTGCCCCTGAGCCA 

GGCTCGGTACCQATGCGTGGGCTGGGCTAGGTCCCTCTGTCCATCTGGGCCTTTGTATGAGCTGCATTGCCCTTG 

CTCACCCTGACCAAGCACACGCCTCAGAGGGGCCCTCAGCCTCTCCTGAAGCCCTCTTGTGGCAAGAACTGTGGA 

Ca^TGCCAGTCCCGTCTGGTTTCCATCCCACCACTCCAAGGAOTGAGACTGACCTCCTCTGGTGACACTGGCCTA 

GAGCCTGACACTCTCCTAAGAGGTTCTCTCO^GCCCCCAAATAGCTCCAGGCGCCCTCGGCCGCCCATCATGGT 

TAATTCTGTCCAACAAACyiCACy^CGGGTAGATTGCTGGCCTGTTGTAGGTGGTAGGGACACAGATGACCGACCTG 

GTCACTCCTCCTGCCS^CATTCAGTCTGGTATGTGAGGCGTGCGTGAAGCAAGAACTCCTGGAGCTACAGGGACA 

GGGAGCCATCATTCCTGCCTGGGAATCCTGGAAGACTTCCTGCAGGAGTCAGCGTTCAATCTTGACCTTGAAGAT 

GGGAAGGATGTTCTTTTTACGTACCaATTCTTTTGTCTTTTGATATTAAA^GAAGTACATGTTCATTGTAGAGA 

ATTTGGAAACTGTAGAAGAGAATCAAGAAGAAAAATAAAAATCAGCTGTTGTAATCGCCTAGCywWiAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAASLAAAAAAAaAAAAAAAAAAAAAAAAAAAAi^AAAAAAAAAA 
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</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA50921 
<subiinit 1 of 1, 693 aa, 1 stop 
<MW: 77738, pi: 8.87, NX(S/T): 7 

MTPQSLLQTTLFLLSLLFLVQGAHGRGHREDFRFCSQRNQTHRSSLHYKPTPDLRISIENSE 
EALTVHAPFPAAHPASRSFPDPRGLYHFCLYWNRHAGRLHLLYGKRDFLLSDKASSLLCFQH 
QEESLAQGPPLLATS VTSWWS PQNI SLPSAASFTFSFHS PPHTAAHNASVDMCELKRDLQLL 
SQFLKHPQKASRRPSAAPASQQLQSLESKLTSVRFMGDMVSFEEDRINATVWKLQPTAGLQD 
LHIHSRQEEEQSEIMEYSVLLPRTLFQRTKGRSGEAEKRLLLVDFSSQALFQDKNSSQVLGE 
KVLGIWQNTKYANLTEPVVLTFQHQLQPK]WTLQCVFWVEDPTLSSPGHWSSAGCETVRRE 
TQTSCFClffiLTYFAVmVSSVEVDAVHKHyLSLLSYVGCWSALACLVTIAAyLCSRVPLPC 
RRKPRDYTIKVHMNLLLAVFLLDTSFLLSEPVALTGSEAGCRASAIFLHFSLLTCLSWMGLE 
GYNLYRLWEVFGTYVPGYLLKLSAMGWGFPIFLVTLVALVDVDNYGPIILAVHRTPEGVIY 
PSMCWIRDSLVSYITNLGLFSLVFLFNMAMLATMWQILRLRPHTQKWSHVLTLLGLSLVLG 
LPWALIFFSFASGTFQLWLYLFSIITSFQGFLIFIWYWSMRLQARGGPSPLKSNSDSARLP 
ISSGSTSSSRI 

Important features: 

Signal peptide: 

amino acids 1-25 

Putative transmembrane domains: 

amino acids 382-398, 402-420, 445-468, 473-491, 519-537, 568-590 
and 634-657 

Microbodies C- terminal targeting signal. 

amino acids 691-693 

cAMP- and cGMP- dependent protein kinase phosphorylation sites, 
amino acids 198-201 and 370-373 
N-glycosylation sites. 

amino acids 39-42, 148-151, 171-174, 234-237, 303-306, 324-327 
and 341-344 

G-protein coupled receptors family 2 proteins 

amino acids 475-504 
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TGCCTGGCCTGCCTTGTCAA.CAATGCCGCTTACTCTGCTTCCAGGTTGCCCTGCCTTGCAGA 
GGAAANCNTCGGGACTACACCNTCAAGTGCACATGAACCTGCTGCTGGCCGTCTTCCTGCTG 
GACACGAGCTTCCTGCTCAGCGNAGCCGGTGGCCCTGACAGGCTCTGAAGGCTGGCTGCCGA 
GCCAGTGCCATCTTCCTGCACTTCTCCTGCTCACCTGCCTTTCCTGGATGGGCCTCGAGGGG 
TACAACCTCTACCGACTCGTGGTGGAGGTCTTTGGCACCTATGTCCCTGGCTACCTACTCAA 
GCTGAGCGCCATGGGCTGGGGCTTCCCCATCTTTCTGGTGACGCTGGTGGCCCTGGTGGATG 
TGGACAACTATGGCCCCATCATCTTGGCTGTGCATAGGACTCCAGAGGGCGTCATCTACCCT 
TCCATGTGCTGGATCCGGGACTCCCTGGTCAGCTACATCACCAACCTGGGCCTCTTCAGCCT 
GGTGTTTCTGTTCAACATGG 
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CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTGGTTCAGGTCCAGGTTTTGCTTTGA 

TCCTTTTaUiAAACTGGAGACAa^GAAGA.GGGCTCTAGGAAAAAGTTTTGGATGGGATTATGTGGAaACTACCCT 

GCGATTCTCTGCTGCCAGAGCAGGCTCGGCGCTTCCACCCCa.GTGCAGCCTTCCCCTGGCGGTGGTGAAAGAGAC 

TCGGGAGTCGCTGCTTCCAAAGTGCCCGCCGTGAGTGAGCTCTCACCCCAGTCAGCCA AATGA GCCTCTTCGGGC 

TTCTCCTGCTGa^CATCTGCCCTGGCCGGCCAGAGACAGGGGACTCAGGCGGaATCCAACCTQAGTAGTAAAT^ 

AGTTTTCCAGCAACAAGGAACAGAACGGAGTACaAGATCCTCAGGATGAGAGAATTATTACTGTGTCT^^ 

GAAGTATTCACAGCCCAAGGTTTCCTCATACTTATCCAAGAAATACGGTCTTGGTATGGAQATTAGTAGCA6TAG 

AGGAAAATGTATGGATACAACTTACGTTTGATGAAAGATTTGGGCTTGAAGACCCAGAAGATGACATATGCAAGT 

ATGATTTTGTAGAAGTTGAGGAACCCAGTGATGGAACTATATTAGGGCGCTG6TGTGGTTCTGGTACTGTACCAG 

GAAAACAGATTTCTAAAGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCAGGGT 

TCTGCATCCa.CTACAACATTGTCATGCCACS^TTCACAGAAGCTGTGAGTCCTTCAGTGCTACCCCCTTCAGC^ 

TGCCACTGQACCTGCTTAATAATGCTATAACTGCCTTTAGTACCTTGGAAGACCTTATTCGATATCTTGAACCAG 

AGAGATGGCAGTTGGACTTAGAA6ATCTATATAGGCCAACTTGGCAACTTCTTGGCAAGGCTTTTGTTTTTGGAA 

GAAAATCOiGAGTGGTGGATCTGAACCTTCTAACAGAGGAGGTAAGATTATACaGCTGCACACCTCGTAACTTCT 

CAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTG 

GTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACC 

ACGAGGTCCTTCAGTTQAGACCa^GACCGGTGTCAGGGGATTGCACaAATa^CTCACCGACGTGGCCCTGt^ 

ACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCCGCATCACCACCAGC^ 

GAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGOT 

TC^GGACCTTTCATCTTOVGGATTTACAGTGaVTTCTGSU^GAGGAGACATCaAACAG 

ACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCaATCGTGGAAAGAAAATTAAATGTTGTAT 
TAAATAGATc:a.CCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTC 
GATACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAAC 
TCTAAAGCTCCATGTCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTTTTTTTTTTTGCTCATATTCACAT 
ATGTAAACCAGAACATTCTATGTACTACAAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAftACTTGT 
GTCATGCTGATAGGACaGACTGGAlTTTTCATATTTCTTATTAAAATTTCTGCCTlTTTAGAAGAAGAGAACTACA 
TTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTG 
TTTCATTGTGTACaTTTTTATATTCTCCTTTTQACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCT 
ATTTTTACCAAAGGTATTTAATATTCTTTTTTATQACS^CTTAaATCAACTATTTTTAGCTTGGTAAATTTTTCT 
AAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCA 
TTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAA 
GACTTTTTGAAAATAATTAAATTATCATATCTTCCaTTCCTGTTATTGGAGATGaAAATAAAAAGCAACTTATGA 
AAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTQAGCCTAGCTCAGAAAAACAT 
AAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGAACACATCCTATTTA 
TTGTGATGTTGTGGTTTTATTATCTTAAACTCTGTTCCATACACTTGTATAAATACATGGATATTTTTATGTACA 
GAAGTATGTCTCTTAACaVGTTCACTTATTGTACTCTGGCAATTTAAAAGAAAATCAGTAAAATATTTTGCTTGT 
AAAATGCTTAATATNGTGCCTAGGTTATGTGGTGACTATTTGAATCAAAAATGTATTGAATCATCAAATAAAAGA 
ATGTGGCTATTTTGGGGAGAAAATTAAAAAAAAAAAAAAAAAAAAAGGTTTAGGGATAACAGGGTAATGCGGCC 
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MSLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERI ITVSTNGS IHSPR 
FPHTyPRNTVLWRLVAVEENVWIQLTFDERFGLEDPEDDICKYDFVEVEEPSDGTIIiGRWC 
GSGTVPGKQISKGNQIRIRFVSDEYFPSEPGFCIHYNIVMPQFTEAVSPSVLPPSALPLDLL 
NNAITAFSTLEDLIRYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRWDLNLLTEEVRLY 
SCTPRNFSVS IREELKRTDTI FWPGCLLVKRCGGNCACCLHNCNECQCVPS KVTKKYHEVLQ 
LRPKTGVRGLHKSLTDVALEHHEECDCVCRGSTGG 



Signal sequence: 

amino acids 1-14 
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CCC^lTCTCAAGCTGATClTGGCACCTCTCaLTGCTCTGCTCTCTTCaACC^ 

AGACTAAAAATGGTGTTTCCaATGTGGACACTGJ^GAGAGaaATTCTTATCCTTTTTAACAT^ 

AaACTCCTTGG6GCTAGATGGTTTCCTA2^CTCTGCCCTGTGATGTC::ACTCTGGATGTTCCAAAGAACCATGTG 

ATCGTGGACTGCaa^GACSUlGCATTTGACAGAAATTCCTGGAGGTATTCCCaCGAACACCACGAACCTC^ 

ACC^iTTAACCACATACCAGACATCTCCCCAGCGTCCTTTCACAGACTGGACCaTCTGGTAGAGATCGATO 

TGCAACTGTGTACCTATTCCACTGGGGTCAAAAAACAACATGTGCATCAAGAGGCTGCAGATTAA^ 

TTTAGTGGACTCACTTATTTAAAATCCCTTTACCTGGATGGAAACCAGCTACTAGAGATACCGCAGGGCCTCCCG 

CCTAGCTTACAGCTTCTCaGCCTTGAGGCCAAa^CATCTTTTCCATCAGAAAAGAGAATCTAACAGAACTGGCC 

AACATAGZ^TACTCTACCTGGGCCAAAACTGTTATTATCGAAATCCTTGTTATGTTTCATATTCAATAGAGAAA 

GATGCCTTCCTAAACTTGACAAAGTTAAAAGTGCTCTCCCTGAAAGATAACAATGTCACAGCCGTCCCTACTGTT 

TTGCCATCTACTTTAACAGAACTATATCTCTACAACAACa^TGATTGCaAAAATCCAAGAAGATGATTTT 

CTO^CCaATTACAAATTCTTGACCTAAGTGGAAATTGCCCTCGTTGTTATAATGCCCCATTTCCTTGTGCGCCG 

TGTAAAAATAATTCTCCCCTACAGATCCCTGTAAATGCTTTTGATGCGCTGACAGAATTAAAAGTTTTACGTCTA 

CACAGTAACTCTCTTCaGCATGTGCCCCCAAGATGGTTTAAGAACATCAACAAACTCCAGGAACTGGATCTGTCC 

CAAAACTTCTTGGCCAAAGAAATTGGGGATGCTAAATTTCTGCaTTTTCTCCCCAGCCTCATCCaATTGGAT^ 

TCTTTCaiATTTTGAACTTCAGGTCTATCGTGCaTCTATGAATCTATCyVC^GCATTTTCTTCACTGAAAAGCCTG 

AAAATTCTGCGGATCAGAGGATATGTCTTTAAAGAGTTGAAAAGCTTTAACCTCTCGCCATTACATAATCTTCAA 

AATCTTGAAGTTCTTGATCTTGGCACTAACTTTATAAAAATTGCTAACCTCAGCATGTTTAAACAATTTAAAAGA 

CTGAAAGTCATAGATCTTTCAGTGAATAAAATATCACCTTCAGGAGATTCAAGTGAAGTTGGCTTCTGCTCaAAT 

GCCAGAACTTOTGTAGAl^GTTATGAACCCCAGGTCCTGGAACAATTACATTATTTCAGATATGATAAGTATG^ 

AGGAGTTGCAGATTCAAAAACAAAGAGGCTTCTTTCaTGTCTGTTAATGAAAGCTGCTACaAGTATGGGC^ 

TTGGATCTAAGTAAAAATAGTATATTTTTTGTCAAGTCCTCTGATTTTCAGCATCTTTCTTTCCTCAAATGCCTG 

AATCTGTCAGGAAATCTCATTAGCCaAACTCTTAATGGCAGTGAATTCCAACCTTTAGCAGAGCTGAGATATTTG 

GACTTCTCCa^CAACCGGCTTGATTTACTCCaTTCAACAGGATTTGAAGAGCTTCACAAACTGGA^ 

ATAAGCAGTAATAGCCATTATTTTCaATCAGAAGGAATTACTCATATGCTAAACTTTACCAAGAACCTAAAGGTT 

CTGCAQAAACTGATGATGAACGACAATGACATCTCTTCCTCCACCAGCAGGACCATGGAGAGTGAGTCTCTTAGA 

ACTCTGGAATTCAGAGGAAATCaLCTTAGATGTTTTATGGAGAGAAGGTGATAACAGATACTTACAATTATTCAAG 

AATCTGCTAAAATTAGAGGAATTAGACATCTCTAAAAATTCCCTAAGTTTCTTGCCTTCTGGAGTTTTTGATGGT 

ATGCCTCCAAATCTAAAGAATCTCTCTTTGGCCaAAAATGGGCTCS^TCTTTCAGTTGGAAGAAACTCCaGTO 

CTAAAQAACCTGGaAACTTTGGACCTOiGCCaCAACCaACTGACCACTGTCCCTGAGAGATTATCCAAOT 

AGAAGCCTO^GAATCTGATTCTTAAQAATAATCAAATCAGGAGTCTGACGAAGTATTTTCTACAAGATGCCTTC 

CAGTTGCGATATCTGGATCTCT^GCTCAAATAAAATCCAGATQATCCAAAAGACCAGCTTCCCAGAAAATGTCCTC 

AACAATCTGAAGATGTTGCTTTTGCATCATAATCGGTTTCTGTGCACCTGTGATGCTGTGTGGTTTGTCTGGTGG 

GTTAACCaTACGOAGGTGACTATTCCTTACCTGGCCACaGATGTGACTTGTGTGGGGCCAGGAGCACAC^GGGC 

CAAAGTGTQATCTCCCTGGATCTGTACACCTGTGAGTTAGATCTGACTAACCTGATTCTGTTCTCACTTTCCATA 

TCTGTATCTCTCTTTCTCATGGTGATGATGACAGCAAGTCACCTCTATTTCTGGGATGTGTGGTATATTTACCAT 

IT'CTGTAAGGCCAAGATAAAGGGGTATCAGCGTCTAATATCACCAGACTGTTGCTATGATGCTTTTATTGTGTAT 

GACACTA?AGACCCAGCTGTGACCGAGTGGGTTTTGGCTGAGCTGGTGGCCAAACTGGAAGACCCAAGAGAGAAA 

CATTTTAATTTATGTCTCGAGGAAAGGGACTGGTTACCAGGGCAGCCAGTTCTGGAAAACCTTTCCCAGAGCATA 

CAGCTTAGCAAAAAGACAGTGTTTGTGATGACAGACAAGTATGCAAAGACTGAAAATTTTAAGATAGC^ 

TTGTCCCATCAGAGGCTO^TGGATGAAAAAGTTGATGTGATTATCTTGATATTTCTTGAGAAGCCCTTTCAGAAG 

TCCy^GTTCCTCaiGCTCCGGAAAAGGCTCTGTGGGAGTTCTGTCOTTGAGTGGCCAACAAACCCGCAAGCTC^^ 

CCATACTTCTGGCAGTGTCTAAAGAACGCCCTGGCCACAGACS^TCaTGTGGCCTATAGTCAGGTGTTCAAa 

ACGGTCTAGCCCTTCTTTGCAAAACAa^CTGCCTAGTTTACCAAGGAGAGGCCT 
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MVFPMWTLiaiQILILFNIILISKLLGARWFPKTLPCDVTLDVPKliraVIVDCTDKHLTEIPGG 

IPTNTTNLTLTINHIPDISPASFHRLDHLVEIDFRCNCVPIPLGSKNNMCIKRLQIKPRSFS 

GLTYLKSLYLDGNQLLEIPQGLPPSLQLLSLEANNIFSIRKENLTELANIEILYLGQNCYYR 

NPCYVSYSIEKDAFLNLTKLKVLSLKDmrVTAVPTVLPSTLTELYLYmMIAKIQEDDFm 

NQLQILDLSGNCPRCYNAPFPCAPCKDJNSPLQIPVNAFDALTELKVLRLHSNSIiQHVPPRWF 

KNINKLQELDLSQNFLAKEIGDAKFLHFLPSLIQLDLSFNFELQVYRASMNLSQAFSSLKSL 

KILRIRGYVFKELKSFNLSPLHNLQNLEVLDLGTNFIKIANLSMFKQFKRLKVIDL.SVNKIS 

PSGDSSEVGFCSNARTSVESYEPQVLEQLHYFRYDKYARSCRFKNKEASFMSWESCYKYGQ 

TLDLSKNSIFFVKSSDFQHLSFLKCLNLSGNLISQTLNGSEFQPLAELRYLDFSNNRLDLLH 

STAFEELHKLEVLDISSNSHYFQSEGITHMmFTKNLKVLQKLMMNDNDISSSTSRTMESES 

LRTLEFRGNHLDVLWREGDNRYLQLFKNLLKLEELDISKNSLSFLPSGVFDGMPPNLKNIiSL 

AKNGLKSFSWKKLQCLKNLETLDLSHNQLTTVPERLSNCSRSLKNLILKNNQIRSLTKYFLQ 

DAFQLRYLDLSSNKIQMIQKTSFPENVLNNLKMLLLHHNRFLCTCDAWFVWWVNHTEVTIP 

YLATDVTCVGPGAHKGQSVISLDLYTCELDLTNLILFSLSISVSLFLMVMMTASHLYFWDVW 

YIYHFCKAKIKGYQRLISPDCCYDAFIVYDTKDPAVTEWLAELVAKLEDPREKHFNIiCLEE 

RDWLPGQPVLENLSQSIQLSKKTVFVMTDKYAKTENFKIAFYLSHQRLMDEKVDVIILIFLE 

KPFQKSKFLQLRKRIiCGSSVLEWPTNPQAHPYFWQCLKNALATDNHVAYSQVFKETV 

Signal sequence: 

amino acids 1-26 

Transmembrane domain: 

amino acids 840-860 
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GGGTACCa^TTCTGCGCTGCTGCaAGTTACGGAATGgVAAAATTAGAACAACAGAaA Cft.TGG AAAA^ 

AGTCGTCAATGCTGACCTGCATTTTCCTGCTAATATCTGGTTCCTGTGAGTTATGCGCCGAAGAAAATTTTTCTA 

GAAGCTATCCTTGTGATGAGAAAAAGCaAAATGACTCAGTTATTGCAGAGTGCAGCAATCGTCGACTACAGGAAG 

TTCCCCAAACGGTGGGCAAATATGTGACaGAACTAGACCTGTCTGATAATTTCATCAGi^CACATAACGAATG^ 

CATTTCAAGGGCTGCAAAATCTCACTAAAATAAATCTAAACCACAACCCCAATGTACAGCACCAGAACGG^^ 

CCGGTATACAATCAAATGGCTTGAATATCACS^GACGGGGCaTTCCTC2^CCTAAAAAACCTAAGGQAGT^ 

TTGAAGAOVACCAGTTACCCCAAATACCCTCTGGTTTGCCAGAGTOTTTGACAGAACTTAGTCTAATTC^^ 

ATATATACAACATAACTAAAGAGGGCATTTCAAGACTTATAAACTTGAAAAATCTCTATTTGGCCTGGAACTGCT 

ATTTTAACAAAGTTTGCGAGAAAACTAACATAGAAGATGGAGTATTTGAAACGCTGACAAATTTGGAGTTGCTAT 

CACTATCTTTCAATTCTCTTTCaCACGTGCCaCCCAAACTGCCaAGCTCCCTACGCaAACTTTTTCTGAGC^ 

CCCAGATCAAATACATTAGTGAAGAAGATTTCAAGGGATTGATAAATTTAACATTACTAGATTTAAGCGGG^ 

GTCCGAGGTGCTTCy^TGCCCCATTTCCATGCGTGCCTTGTGATGGTGGTGCTTCAATTAATATAGATCGTTTTG 

CTTTTCAAAACTTGACCCaACTTCGATACCTAAACCTCTCTAGCACTTCCCTCAGGAAGATTAATGCTGCCTGGT 

TTAAAAATATGCCTCATCTGAAGGTGCTGGATCTTGAATTCAACTATTTAGTGGGAGAAATAGTCTCTGGGGCAT 

TTTTAACGATGCTGCCCCGCTTAGAAATACTTGACTTGTCTTTTAACTATATAAAGGGGAGTTATCCACAGCATA 

TTAATATTTCCAGAAACTTCTCTAAACTTTTGTCTCTACGGGCATTGCATTTAAGAGGTTATGTGTTCCAGGAAC 

TCAGAGAAGATGATTTCCAGCCCCTGATGCAGCTTCCAAACTTATCGACTATCAACTTGGGTATTAATTTTATTA 

AGCL?^AATCGATTTCAAACTTTTCCAAAATTTCTCCAATCTGGAAATTATTTACTTGTCAGAAAACAGAATA^ 

CGTTGGTAAAAGATACCCGGCAGAGTTATGCAAATAGTTCCTCTTTTCAACGTCATATCCGGAAACGACGCTCAA 

CAGATTTTGAGTTTGACCCACATTCGAACTTTTATCaTTTCACCCGTCCTTTAATAAAGCCa^CAATGTGCTGCTT 

ATGGAAAAGCCTTAGATTTAAGCCTCASLCaGTATTTTCTTCy^TTGGGCCAAACCaATTTGAAAATCTTCCTC^ 

TTGCCTGTTTAAATCTGTCTGCAAATAGCAATGCTCAAGTGTTAAGTGGAACTGAATTTTCAGCCATTCCTCATG 

TCaAATATTTGGATTTGACaAACAATAGACTAGACTTTGATAATGCTAGTGCTCTTACTGAATTGTCCGACTTGG 

AAGTTCTAGATCTCAGCTATAATTCACACTATTTCAGaATAGCAGGCGTAACACATCaTCTAGAATTTATTC^ 

ATTTCa^CAAATCTAAAAGTTTTAAACTTGAGCCACaACAACaiTTTATACTTTAACaGATAAGTATAJlCC^ 

GCAAGTCCCTGGTAGAATTAGTTTTCAGTGGCAATCGCCTTGACATTTTGTGGaATGATGATGACAACAGGTATA 

TCTCCATTTTCAAAGGTCTCAAGAATCTGACACGTCTGGATTTATCCCTTAATAGGCTGAAGCACaTCCCAAATG 

AAGCATTCCTTAATTTGCCAGCGAGTCTCACTQAACTACATATAAATGATAATATGTTAAAGTTTTTTAACTGGA 

CATTACTCCAGCAGTTTCCTCGTCTCGAGTTGCTTGACTTACGTGQAAACAAACTACTCTTTTTAACTGATAGCC 

TATCTGACTTTACATCTTCCCTTCGGAGACTGCTGCTGAGTCATAACAGGATTTCCCACCTACCCTCTGGCTTTC 

TTTCTGAAGTa^GTAGTCTGAAG<:a.CCTCGATTTAAGTTCCaATCTGCTAAAAACAATCAACAAATCCGCaCT 

AAACTAAQACCACCACCAAATTATCTATGTTGGAACTACACGGAAACCCCTTTQAATGCACCTGTGACATTGGAG 

ATTTCCGAAGATGGATGGATGAACATCTGAATGTCAAAATTCCCAGACTGGTAGATGTCS^TTTGTGCCaLGTCCTO 

GGGATCAAAGAGGGAAGAGTATTGTGAGTCTGGAGCTAACAACTTGTGTTTCAGATGTCACTGCAGTGATATTAT 

GGTTTATATATAATGTGTGTTTAGCTAAGGTAAAAGGCTACAGGTCTCTTTCCACATCCCAAACTTTCTATGATG 
CTTACATTTCTTATGACACCAAAQATGCCTCTGTTACTGACTGGGTGATAAATGAGCTGCGCTACCACCTTGAAG 
AGAGCCXSAGACaAAAACGTTCTCCTTTGTCTAGAGGAGAGGGATTGGGACCCGGGATTGGCCATCaTCGACAACC 
TCATGCAGAGCATCaACCBAAGCAAGAAAACAGTATTTGTTTTAACCAAAAAATATGC^^ 

AAACAGCTTTTTACTTGGCTTTGCAGAGGCTAATGGATGAGAACATGGATGTGATTATATTTATCCTGCTGGAGC 

CaGT6TTACAGaVTTCTCAGTATTTGAGGCTACGGCAGCGGATCTGTAAGA6CTCCATCCTCC:a.GTGGCCTGAC^ 

ACCCGAAGGCAGAAGGCTTGTTTTGGCSUU^CTCTGAGAAATGTGGTCTTGACTGAAAATGATTOICGGTATAACA 

ATATGTATGTCGATTCCATTAAGCAATACTAACTGACGTTAAGTCATGATTTCGCGCCATAATAAAGATGCAAAG 

GAATGACATTTCTGTATTAGTTATCTATTGCTATGTAACy^AATTATCCC?\AAACTTAGTGGTTTAAAACAACAC^ 

TTTGCTGGCCCACAGTTTTTGAGGGTCAGGAGTCCAGGCCCAGCATAACTGGGTCCTCTGCTCAGGGTGTCTC^ 

AGGCTGCAATGTAGGTGTTCACCAGAGACATAGGCaT(:a^CTGGGGTCACACTCATGTGGTTGTTTTC 

ATTCCTCCTGGGCTATTGGCCAAAGGCTATACTCATGTAAGCCATGCGAGCCTCTCCCACAAGGCAGCTTGCTTC 

ATCAGAGCTAGCaAAAAAGAGAGGTTGCTAGCAAGATGAAGTCACAATCTTTTGTAATCGAATCaA 

lATCTG^TCACTTTGGCCATATTCTATTTGTTAGAAGTAAACCACaGGTCCCACCAGCTCCATGGGAGTGAC^ 

TCAGTCCAGGGAAAACAGCTGAAGACCAAGATGGTGAGCTCTGATTGCTTCAGTTGGTCATCAACTATTTTCCCT 

TGACTGCTGTCCTGGGATGGCCTGCTATCTTGATGATAGATTGTGAATATCAGGAGGCAGGGATCACTGTGGACC 

ATCTTAGCAGTTGACCTAACACATCTTCTTTTCAATATCTAAGAACTTTTGCCACTGTGACTAATGGTCCTAATA 

TTAAGCTGTTGTTTATATTTATCATATATCTATGGCTACATGGTTATATTATGCTGTGGTTGCGTTCGGTTTTAT 

TTACAGTTGCTTTTACAAATATTTGCTGTAACATTTGACTTCTAAGGTTTAGATGCCATTTAAGAACTGAGATGG 

ATAGCTTTTAAAGCATCTTTTACTTCTTACCATTTTTTAAAAGTATGCAGCTAAATTCGAAGCTTTTGGTCTATA 

TTGTTAATTGCCATTGCTGTAAATCTTAAAATGAATGAATAAAAATGTTTCATTTTACAAAAAAAAAAAAAAAA 
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MENMPLQSSMLTCIFLLISGSCELCAEENFSRSYPCDEKKQNDSVIAECSNRRLQEVPQTVG 
KYVTELDLSDNFITHITNESFQGLQNLTKINLNHNPNVQHQNGNPGIQSNGLNITDGAFLNL. 
KNLRELLLEDNQLPQIPSGLPESLTELSLIQNNIYNITKEGISRLINLKNLYLAWNCYFNKV 
CEKTNIEDGVFETLTNLELLSLSFNSLSHVPPKLPSSLRKLFLSNTQIKYISEEDFKGLINL 
TLLDLSGNCPRCFNAPFPCVPCDGGASINIDRFAFQNLTQLRYLNLSSTSLRKINAAWFKNM 
PHLKVLDLEFNYLVGEIVSGAFLTMLPRLEILDLSFNYIKGSYPQHINISRNFSKLLSLRAL 
HLRGYVFQELREDDFQPLMQLPNLSTINLGINFIKQIDFKLFQNFSNLEIIYLSENRISPLV 
KDTRQSYANSSSFQRHIRKRRSTDFEFDPHSNFYHFTRPLIKPQCAAYGKALDLSLNSIFFI 
GPNQFENLPDIACLNLSANSNAQVLSGTEFSAIPHVKYLDLTNNRLDFDNASALTELSDLEV 
LDLSYNSHYFRIAGVTHHLEFIQNFTNLKVLNLSHNNIYTLTDKYNLESKSLVELVFSGNRL 
DILWNDDDNRYI S I FKGLKNLTRLDLSLNRLKHI PNEAFLNLPASLTELHINDNMLKFFNWT 
LLQQFPRLELLDLRGNKLLFLTDSLSDFTSSLRTLLLSHNRISHLPSGFLSEVSSLKHLDLS 
SNLLKTINKSALETKTTTKLSMLELHGNPFECTCD IGDFRRWMDEHLNVKI PRLVDVI GASP 
GDQRGKSIVSLELTTCVSDVTAVILFFFTFFITTMVMLAALAHHLFYWDWFIYNVCLAKVK 
GYRSLSTSQTFYDAYISYDTKDASVTDWVINELRYHLEESRDKNVLLCLEERDWDPGLAIID 
NLMQS INQS KKTVFVLTKKYAKSWNFKTAFYLALQRLMDENMDVI I PI LLEPVLQHSQYLRL 
RQRI CKS S I LQWPDNPKAEGLFWQTLRNWLTENDSRYNNMYVDS IKQY 

Signal sequence: 

amino acids 1-26 

Transmembraxie domain: 

amino acids 826-848 



FIGURE 212 



CCAGGTCCAACTGCACCTCGGTTCTATCGATTGAATTCCCCGGGGATCCTCTAGAGATCCCT 
CGACCTCGACCCACGCGTCCGCCAAGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGGA 
CAGGCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGC 
AAGGGCTAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGC 
TCCAGCAGCATCAGCAGCCCCCAGGACCGGGGAGGCACAGGTGGCCCCCACCACCCGGAGGA 
GCAGCTCCTGCCCCTGTCCGGGGGATGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGAA 
GGCCACCCCGCCTGGAGGCACAGGCCATGAGGGGCTCTCAGGAGGTGCTGCTGATGTGGCTT 
CTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTACCGGCCCGGCCGTAGGGTGTGTGCTGT 
CCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCCTCA 
CCACCTGCGACGGGCACCGGGCCTGCAGCACCTACCGAACCATCTATAGGACCGCCTACCGC 
CGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAGGAC 
CAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCATGCCGGAACGGAGGGA 
GCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCAGGATGGCGGGGTGACACTTGCCAGTCA 
GATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTGCATCAACACCGCCGG 
CAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCCTGTCTGCAGACGGTACACTCTGTGTGC 
CCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGGACAGTGCAATGAAGGAA 
GAAGTGCAGAGGCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCTGGC 
CCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCCTCC 
TGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTCCTG 
GAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGACTCGTGACTGCCCAGCGCCCCAGGCTG 
GACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTCCAG 
AAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTCTTCCTCCTCCCC 
TTCCTCGGGAGGCTCCCCAGACCCTGGCATGGGATGGGCTGGGATCTTCTCTGTGAATCCAC 
CCCTGGCTACCCCCACCCTGGCTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCAGCTG 
AGGGAAGGTACGAGCTCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCCCGGAG 
GCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAACGTGA 
^^^^^^^^^^^^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAftAAAAAGGGCGGCCGCGACTCTAGAGT 
C6ACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATGGTTACAAAT 



tmrm 



FIGURE 213 



MRGSQEVLLMWLLVIAVGGTEHAYRPGRRVCAVIL?mGDPVSESFVQRVYQPFLTTCDGHRA^ 
STYRTiyRTAYRRSPGLAPARPRYACCPGWKRTSGLPGACGAAICQPPCRNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCINTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEE VQRLQSRVDIiLEEKLQLVLAPLHSLASQALEHGLPDPGS LLVHS FQQLG 
RIDSLSEQISFLEEQLGSCSCKKDS 



Signal sequence: 

1-19 



FIGURE 214 



GCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGCAAG 
GGCTAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGCTCC 
AGCAGCATCAGAGCAGCCCCTGTGGTTGGCAGCAAAGTTCAGCTTGGCTGGGCCCGCTGTGA 
GGGGCTTCGCGCTACGCCCTGCGGTGTCCCGAGGGCTGAGGTCTCCTCATCTTCTCCCTAGC 
AGTGGATGAGCAACCCAACGGGGGCCCGGGGAGGGGAACTGGCCCCGAGGGAGAGGAACCCC 
AAAGCCACATCTGTAGCCAGGATGAGCAGTGTGAATCCAGGCAGCCCCCAGGACCGGGGAGG 
CACAGGTGGCCCCCACCACCCGGAGGAGCAGCTCCTGCCCCTGTCCGGGGGATGACTGATTC 
TCCTCCGCCAGGCCACCCAGAGGAGAAGGCCACCCCGCCTGGAGGCACAGGCCATGAGGGGC 
TCTCAGGAGGTGCTGCTGATGTGGCTTCTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTA 
CCGGCCCGGCCGTAGGGTGTGTGCTGTCCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCG 
TGCAGCGTGTGTACCAGCCCTTCCTCACCACCTGCGACGGGCACCGGGCCTGCAGCACCTAC 
CGAACCATCTATAGGACCGCCTACCGCCGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTA 
CGCGTGCTGCCCCGGCTGGAAGAGGACCAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATAT 
GCCAGCCGCCATGCCGGAACGGAGGGAGCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCA 
GGATGGCGGGGTGACACTTGCCAGTCAGATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTG 
TCCCCAGCGCTGCATCAACACCGCCGGCAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCC 
TGTCTGCAGACGGTACACTCTGTGTGCCCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCG 
ACAGGAGTGGACAGTGCAATGAAGGAAGAAGTGCAGAGGCTGCAGTCCAGGGTGGACCTGCT 
GGAGGAGAAGCTGCAGCTGGTGCTGGCCCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGC 
ATGGGCTCCCGGACCCCGGCAGCCTCCTGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGAC 
TCCCTGAGCGAGCAGATTTCCTTCCTGGAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGA 
CTCGTGACTGCCCAGCGCTCCAGGCTGGACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATG 
CCCCTGCCCAACATGCTGGGGGTCCAGAAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGC 
AGGGCCTTCCTCCTCTTCCTCCTCCCCTTCCTCGGGAGGCTCCCCAGACCCTGGCATGGGAT 
GGGCTGGGATCTTCTCTGTGAATCCACCCCTGGCTACCCCCACCCTGGCTACCCCAACGGCA 
TCCCAAGGCCAGGTGGACCCTCAGCTGAGGGAAGGTACGAGCTCCCTGCTGGAGCCTGGGAC 
CCATGGCACAGGCCAGGCAGCCCGGAGGCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGAC 
CCCCAGCACAATAAAAATGAAACGTG 



FIGURE 215 



MRGSQEVLLMWLLVIAVGGTEHAYRPGRRVCAVRAHGDPVSESFVQRVYQPFLTTCDGHRAC 
STYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLPGACGAAICQPPCRNGGSCVQPGRCR 
C PAGWRGDTCQSDVDECSARRGGCPQRC INTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRLQSRVDLLEEKLQr.VLAPLHSLASQALEHGLPDPGSLLVHSFQQLG 
RIDSLSEQISFLEEQLGSCSCKKDS 



Signal sequence: 
1-19 



FIGURE 216 



CCCACGCGTCCGAAGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGGACAGGCCAGGCA 
GGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGCAAGGGCTAGGG 
TCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGCTCCAGCAGCAT 
CAGCAGCCCCCAGGACCGGGGAGGCACAGGTGGCCCCCACCACCCGGAGGAGCAGCTCCTGC 
CCCTGTCCGGGGGATGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGAAGGCCACCCCGC 
CTGGAGGCACAGGCCATGAGGGGCTCTCAGGAGGTGCTGCTGATGTGGCTTCTGGTGTTGGC 
AGTGGGCGGCACAGAGCACGCCTACCGGCCCGGCCGTAGGGTGTGTGCTGTCCGGGCTCACG 
GGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCCTCACCACCTGCGAC 
GGGCACCGGGCCTGCAGCACCTACCGAACCATCTATAGGACCGCCTACCGCCGCAGCCCTGG 
GCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAGGACCAGCGGGCTTC 
CTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCATGCCGGAACGGAGGGAGCTGTGTCCAG 
CCTGGCCGCTGCCGCTGCCCTGCAGGATGGCGGGGTGACACTTGCCAGTCAGATGTGGATGA 
ATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTGCGTCAACACCGCCGGCAGTTACTGGT 
GCCAGTGTTGGGAGGGGCACAGCCTGTCTGCAGACGGTACACTCTGTGTGCCCAAGGGAGGG 
CCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGGACAGTGCAATGAAGGAAGAAGTGCAGAG 
GCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCTGGCCCCACTGCACA 
GCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCCTCCTGGTGCACTCC 
TTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTCCTGGAGGAGCAGCT 
GGGGTCCTGCTCCTGCAAGAAAGACTCGTGACTGCCCAGCGCCCCAGGCTGGACTGAGCCCC 
TCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTCCAGAAGCCACCTCG 
GGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTCTTCCTCCTCCCCTTCCTCGGGAG 
GCTCCCCAGACCCTGGCATGGGATGGGCTGGGATCTTCTCTGTGAATCCACCCCTGGCTACC 
CCCACCCTGGCTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCAGCTGAGGGAAGGTAC 
GAGCTCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCCCGGAGGCTGGGTGGGG 
CCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAACGTG 



FIGURE 217 



MRGSQEVLLMWLLVLAVGGTEHAYRPGRRVCAVRAHGDPVSESFVQRVYQPFLTTCDGHRAC 
STYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLPGACGAAICQPPCRNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCVNTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSIASQALEHGLPDPGSLLVHSFQQLG 
RIDSLSEQISFLEEQLGSCSCKKDS 



Signal sequence: 

1-19 



FIGURE 218 

GGTTGCCACAGCTGGTTTAGGGCCCCQACCACTGGGGCCCCTTGTCAGGAGGAGACAGCCTCCCGGCCCGGGGAG 
GACAAGTCGCTGCCACCTTTGGCTGCCGACGTGATTCCCTGGGACGGTCCGTTTCCTGCCGTCAGCTGCCGGCCG 
AGTTGGGTCTCCGTGTTTCA.GGCCGGCTCCCCCTTCCTGGTCTCCCTTCTCCCGCTGGGCCGGTTTATCGGGAGG 
AGATTGTCTTCCAGGGCTAGCaATTGGACTTTTGATGATGTTTGACCa^GCGGCAGGAATAGCAGGCA^ 
TTCAAAGCTGGGCTCSiGCCTCTGTTTCTTCTCTCGTGTAATCGCaAAACCCaTTTTGGAGCAGGAATTCCAATCA 
TGTCTGTGATGGTG6TGAGAAAGAAGGTGACACGGAAATGGGAGAAACTCCCAGGCAGGAACACCTTTTGCTGTG 
ATGGCCGCGTCATGATGGCCCGGCAAAAGGGCATTTTCTACCTGACCCTTTTCCTCATCCTGGGGACATGTACAC 
TCTTCTTCGCCTTTGAGTGCCGCTACCTGGCTGTTCAGCTGTCTCCTGCCATCCCTGTATTTGCTGCCATGCTCT 
TCCTTTTCTCCATGGCTACACTGTTGAGGACCAGCTTCAGTGACCCTGGAGTGATTCCTCGGGCGCTACCAGATG 
AAGCAGCTTTOlTAGAAATGGAGATAGAAGCTACCaATGGTGCGGTGCCCCAGGGCCaGCQACCACCGCCTCGTA 
• TO^GAATTTCCAGATAAACSiACa^GATTGTGAAACTGAAATACTGTTACACATGa^GATCTTCCGGCCTC 
GGGCCTCCCATTGCAGCATCTGTGACAACTGTGTGGAGCGCTTCGACCATCACTGCCCCTGGGTGGGGAATTGTG 
TTGGAAAGAGGAACTACCGCTACTTCTACCTCTTCATCCTTTCTCTCTCCCTCCTCACAATCTATGTCTTCGCCT 
TCAACATCGTCTATGTGGCCCTCAAATCTTTGAAAATTGGCTTCTTGGAGACATTGAAAGAAACTCCTGGAACTG 
TTCTAGAAGTCCTCATTTGCTTCTTTACACTCTGGTCCGTCGTGGGACTGACTGGATTTCATACTTTCCTCGTGG 
CTCTCAACCAGACAACCaATGAAGACATCaAAGGATCATGGACAGGGAAGAATCGCGTCCAGAA 
ATGGCAATATTGTOAAGAACTGCTGTGAAGTGCTGTGTGGCCCCTTGCCCCCCAGTGTGCTGGATCGAAGGGGTA 
TTTTGCCACTGQAGGAAAGTGGAAGTCGACCTCCCAGTACTC^^GaGACCAGTAGCAGCCTCTTGCCaCAGAGCC 
CAGCCCCCACAGAACACCTGAACTCAAATGAGATGCCGGAGGACAGCa.GCACTCCCGAAGAGAT 
AGCCCCCAGAGCCACCACAGGAGGCAGCTGAAGCTGAGAAGTAGCCTATCTATGGAAGAGACTTTTGTTTGTGTT 
TAATTAGGGCTATGAGAGATTTCAGGTGAGAAGTTAAACCTGAGACAGAGAGCAAGTAAGCTGTCCCTTTTAACT 
GTTTTTCTTTGGTCTTTAGTCACCCAGTTGCa^CACTGGCATTTTCTTGCTGCaAGCTTTTTTAAA 
CAAGGCAGTGGCAGAAGATGTCAGTCACCTCTGATAACTGGAAAAATGGGTCTCTTGGGCCCTGGCACTGGTTCT 
CCATGGCCTCAGCCACAGGGTCCCCTTGGACCCCCTCTCTTCCCTCCAGATCCCAGCCCTCCTGCTTGGGGTCAC 
TGGTCTaVTTCTGGGGCTAAAAGTTTTTGAGACTGGCTCA?iATCCTCCCAAGCTGCTGCy^CGTGCTGAGTCC^ 
GGCAGTCa.C7VQAGACCTCTGGCaiGGGGATCCTAACTGGGTTCTTGGGGTCTTCAGGACTGAAGAGGAGGGAGAG 
TGGGGTCAGAAGATTCTCCTGGCCACCaAGTGCCAGCATTGCCCa-CAAATCCTTTTAGGAATGGGACAGGTACCT 
TCCACTTGTTGTAKnsnmffinnmiS^^ 

CAGGAATGGCAGTAATAAAAGTCTGCACTTTGGTCATTTCTTTTCCTCAGAGGAAGCCCGAGTGCTCACTTAAAC 
ACTATCCCCTCaGACTCCCTGTGTGAGGCCTGCAGAGGCCCTGAATGCACAAATGGGAAACCAAGGCACAGAGAG 
GCTCTCCTCTCCTCTCCTCTCCCCCGATGTACCCTCAAAAAAAAAAAAATGCTAACCAGTTCTTCCATTAAGCCT 
CGGCTGAGTGAGGQAAAGCCCAGCACTGCTGCCCTCTCGGGTAACTCACCCTAAGGCCTCGGCCCACCTCTGGCT 
ATGGTAACCACACTGGGGGCTTCCTCCaAGCCCCGCTCTTCCAGCACTTCaiCCGGCAGAGTCCCAGAGCCACTT 
CACCCTGGGGGTGGGCTGTGGCCCCCAGTCAGCTCTGCTCAGGACCTGCTCTATTTCAGGGAAGAAGATTTATGT 
ATTATATGTGGCTATATTTCCTAGAGCACCTGTGTTTTCCTCTTTCTAAGCCAGGGTCCTGTCTGGATGACTTAT 
GCGGTGGGGGAGTGTAAACCGGAACTTTTCATCTATTTGAAGGCGATTAAACTGTGTCTAATGCA 



FIGURE 219 



MSVMVVRKKVTRKWEKLPGRNTFCCDGRVMMARQKGIFYLTLFLILGTCTLFFAFECRYI^ 

QIiSPAIPVFAAMLFIiFSMATLLRTSFSDPGVIPRALPDEAAFIEMEIEATNGAVPQGQRPPP 

RIKNFQINNQIVKLKYCYTCKIFRPPRASHCSICDNCVERFDHHCPWVGNCVGKRNYRYFYL 

FILSLSLLTIYVFAFNIVYVALKSLKIGFLETLKETPGTVLEVLICFFTLWSWGLTGFHTF 

LVALNQTTNED I KGSWTGKNRVQNPYSHGNI VKNCCEVLCGPLPPSVLDRRGI LPLEESGSR 

PPSTQETSSSLLPQSPAPTEHLNSNEMPEDSSTPEEMPPPEPPEPPQEAAEAEK 



Putative transmembrane domains: 

amino acids 36-55 (type II TM) , 65-84, 188-208, 229-245 



FIGURE 220 



AAAACCCTGTATTTTTTACAATGCAAATAGACAATNANCCTGGAGGTCTTTGAATTAGGTAT 
TATAGGGATGGTGGGGTTGATTTTTNTTCCTGGAGGCTTTTGGCTTTGGACTCTCNCTTTCT 
CCCACAGAGCNCTTCGACCATCACTGCCCCTGGGTGGGGAATTGTGTTGGAAAGAGGAACTA 
CCGCTANTTCTACCTCTTCATCCTTTNTCTCTCCCNCCTCACAATCTATGTCTTCGCCTTCA 
ACATCGT 



FIGURE 221 



GTTGTGTCCTTCAGCAAAACAGTGGATTTAAATCTCCTTGCACAAGCTTGAGAGCAACACAA 

TCTATCTIGGAAAGAAAGAAAGAAAAAAACCGAACCTGACAAAAAAGAAGAAAAAGAAGAAGA 

AAAAAAATCATGAAAACCATCCAGCCAAAAATGCACAATTCTATCTCTTGGGCJ^ 

GGGGCTGGCTGCTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCAGCGGAGATGCCACCTTCC 

CCAAAGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTATT 

GACAACCGGGTCACCCGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTATGCTGGGAATGA 

CAAGTGGTGCCTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCAAACGCAGTACAGCATCG 

AGATCCAGAACGTGGATGTGTATGACGAGGGCCCTTACACCTGCTCGGTGCAGACAGACAAC 

CACCCAAAGACCTCTAGGGTCCACCTCATTGTGCAAGTATCTCCCAAAATTGTAGAGATTTC 

TTCAGATATCTCCATTAATGAAGGGAACAATATTAGCCTCACCTGCATAGCAACTGGTAGAC 

CAGAGCCTACGGTTACTTGGAGACACATCTCTCCCAAAGCGGTTGGCTTTGTGAGTGAAGAC 

GAATACTTGGAAATTCAGGGCATCACCCGGGAGCAGTCAGGGGACTACGAGTGCAGTGCCTC 

CAATGACGTGGCCGCGCCCGTGGTACGGAGAGTAAAGGTCACCGTGAACTATCCACCATACA 

TTTCAGAAGCCAAGGGTACAGGTGTCCCCGTGGGACAAAAGGGGACACTGCAGTGTGAAGCC 

TCAGCAGTCCCCTCAGCAGAATTCCAGTGGTACAAGGATGACAAAAGACTGATTGAAGGAAA 

GAAAGGGGTGAAAGTGGAAAACAGACCTTTCCTCTCAAAACTCATCTTCTTCAATGTCTCTG 

AACATGACTATGGGAACTACACTTGCGTGGCCTCCAACAAGCTGGGCCACACCAATGCCAGC 

ATCATGCTATTTGGTCCAGGCGCCGTCAGCGAGGTGAGCAACGGCACGTCGAG6AGGGCAGG 

CTGCGTCTGGCTGCTGCCTCTTCTGGTCTTGCACCTGCTTCTCAAATTTTGATGTGAGTGCC 

ACTTCCCCACCCGGGAAAGGCTGCCGCCACCACCACCACCAACACAACAGCAATGGCAACAC 

CGACAGCAACCAATCAGATATATACAAATGAAATTAGAAGAAACACAGCCTCATGGGACAGA 

AATTTGAGGGAGGGGAACAAAGAATACTTTGGGGGGAAAAGAGTTTTAAAAAAGAAATTGAA 

AATTGCCTTGCAGATATTTAGGTACAATGGAGTTTTCTTTTCCCAAACGGGAAGAACACAGC 

ACACCCGGCTTGGACCCACTGCAAGCTGCATCGTGCAACCTCTTTGGTGCCAGTGTGGGCAA 

GGGCTCAGCCTCTCTGCCCACAGAGTGCCCCCACGTGGAACATTCTGGAGCTGGCCATCCCA 

AATTCAATCAGTCCATAGAGACGAACAGAATGAGACCTTCCGGCCCAAGCGTGGCGCTGCGG 

GCACTTTGGTAGACTGTGCCACCACGGCGTGTGTTGTGAAACGTGAAATAAAAAGAGCAAAA 

AAAAA 



FIGURE 222 



MKTIQPKMHNSISWAIFTGLAALCLFQGVPVRSGDATFPKAMDNVTVRQGESATLRCTIDNR 
VTRVAWLNRSTILYAGNDKWCLDPRWLLSNTQTQYSIEIQNVDVYDEGPYTCSVQTDNHPK 
TSRVHLIVQVSPKIVEISSDISINEGNNISLTCIATGRPEPTVTWRHISPKAVGFVSEDEYL 
EIQGITREQSGDYECSASNDVAAPWRRVKVTVNYPPYISEAKGTGVPVGQKGTLQCEASAV 
PSAEFQWYKDDKRLIEGKKGVKVENRPFLSKLIFFNVSEHDYGNYTCVASNKLGHTNASIML 
FGPGAVSEVSNGTSRRAGCVWLLPLLVLHLLLKF 

Signal peptide; 

amino acids 1-28 



FIGURE 223 



GAAAAAAAATCATGAAAACCATCCAGCCAAAAATGCACAATTCTATCTCTTGGGCAATCTTC 

ACGGGGCTGGCTGCTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCAGCGGAGATGCCACCTT 

CCCCAAAGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTA 

TTGACAACCGGGTCACCCGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTATGCTGGGAAT. 

GACAAGTGGTGCCTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCAAACGCAGTACAGCAT 

CGAGATCCAGAACGTGGATGTGTATGACGAGGGCCCTTACACCTGCTCGGTGCAGACAGACA 

ACCACCCAAAGACCTCTAGGGTCCACCTCATTGTGCAAGTATCTCCCAAAATTGTAGAGATT 

TCTTCAGATATCTCCATTAATGAAGGGAACAATATTAGCCTCACCTGCATAGCAACTGGTAG 
ACCAGAG 



FIGURE 224 



ATGGCTGGTGACGGCGGGGCCGGGCAGGGGACCGGGGCCGCGGCCCGGGAGCGGGCCAGCTGCCGGGAGCCCTGA 

ATCACCGCCTGGCCCGA.CTCCACCATGAACGTCGCGCTGCAGGAGCTGGGAGCTGGCAGCAAC^^ 

i^GGGQAOlAGACAGCTGTTAGGCTCaCGa^CGCAGCTGGAGCTGGTCTTAGCAGGTGCCTCTCTACTGCTG^ 

GCACTGCTTCTGGGCTGCCTTGTGGCCCTAGGGGTCCAGTACCACAGAGACCCATCCCACAGCACCTGCCTTACA 

GAGGCCTGCATTCGAGTGGCTGGAAAAATCCTGGAGTCCCTGGACCGAGGGGTGAGCCCCTGTGAGGACTTTTAC 

CAGTTOTCCTGTGGGGGCTGGATTCGGAGGAACCCCCTGCCCGATGGGCGTTCTCGCTGGAACACCTTCAACAGC 

CTCTGGGACCAi^CCAGGCCATACTGAAGCACCTGCTTQAAAACaCCaCCTTCy^ 

cagaagao^cagcgcttctacctatcttgcctacaggtggagcgcy^ttgaggagctgggagcccagccslctgac^ 

gacctcattgagaagattggtggttggaa(:a.ttacggggccctgggaccaggacaactttatggaggtgttgaag 

gcagtagcagggacctacagggccacccc:a.ttcttca.ccgtctacatcagtgccgactctaagagttccaaai 

aatgttatccaggtggaccagtctgggctctttctgccctctcgggattactacttaaacagaactgccaatgag 

aaagtgctcact6cctatctggattacatggaggaactggggatgctgctgggtgggcggcccacctccacgagg 

gagcagatgcagcaggtgctggagttggagatacagctggccaacatcacagtgccccaggaccagcggcgcgac 

gaggag?uigatctacc:acaagatgagcatttcggagctgcaggctctggcgccctccatggact<^ 

ctgtctttcttgctgtcaccattggagttgagtgactctgagcctgtggtggtgtatgggatggattatttgcag 

caggtgtcyvgagctca.tcauiccga^cggaaccaagcatcctgaacaattacctgatctggaacct 

acaacctcaagcctggaccgacgctttgagtctgcacaagagaagctgctggagaccctctatggcactaag^ 

tcctgtgtgccgaggtggcagacctgcatctccaacacggatgacgcccttggctttgctttggggtcactcttc 

gtgaaggccacgtttgaccggcaaagcaaagaaattgcagaggggatgatcagcgaaatccggaccgcatttgag 

gaggccctgggacagctggtttggatggatgagaagacccgccagggagccaaggagaaagcagatgccatctat 

gatatgattggtttcccagactttatcctggagcccaaagagctggatgatgtttatgacgggtacgaaatttct 

gaaqattctttcttccaaaacatgttgaatttgtacaacttctotgccaaggttatggctgaccagctccgc^ 

cctcccagccgagacc^gtggagcatgaccccccagacagtgaatgcctactaccttccaactaagaatgagatc 

gtcttccccgctggoitcctgcaggcccccttctatgcccgcaaccaccccaaggccctgaacttcggtggcatc 

ggtgtggtcaitgggccatgagttgacgcatgcctttgatqaccaagggcgcgagtatgacyu^gaagggaacctg 

cggccctggtggcagaatgagtccctggcagccttccggaaccacacggcctgcatggaggaacagtacaata^ 

taccaggtcaatggggagaggctcaacggccgccagacgctggggqagaacattactgacaacggggggctgaag 

GCTGCOTAO^ATGCTTACRAAGCATGGCTGAGaAAGCATGGGQAGQAGCAGCAACTGCCyiG^^^ 

AACCACCAGCTCTTCTTCGTGGGATTTGCCCAGGTGTGGTGCTCGGTCCGCACaCCAQAGAGCrCTCACGAGGGG 

CTGGTGACCGACCCCCACAGCCCTGCCCGCTTCCGCGTGCTGGGCACTCTCTCCAACTCCCGTGACTTCCTGCGG 

CACTTCGGCTGCCCTGTCGGCTCCCCCATGAACCCAGGGCAGCTGTGTGAGGTGTGGTAGACCTGGATCa^GGGG^ 

GAAATGGCCAGCTGTCACCAGACCTGGGGCaGCTCTCCTGACAAAGCTGTTTGCTCTTGGGTTGGGAGGAAGCAA 

ATGCAAGCTGGGCTGGGTCTAGTCCCTCCCCCCCACAGGTGACATGAGTACAGACCCTCCTCAATCACCACATTG 

TGCCTCTGCTTTGGGGGTGCCCCTGCCTCCAGCAGAGCCCCCACCATTCACTGTGACATCTTTCCGTGTCACCCT 

GCCTGGAAGAGGTCTGGGTGGGGAGGCCAGTTCCCATAGGAAGGAGTCTGCC 



FIGURE 225 



MNVALQELGAGSNVGFQKGTRQLLGSRTQLELVLAGASLLLAALLLGCLVALGVQYHRDPSH 
STCLTEACIRVAGKILESLDRGVSPCEDFYQFSCGGWIRRNPLPDGRSRWNTFNSLWDQNQA 
ILKHLLENTTFNSSSEAEQKTQRFYLSCLQVERIEELGAQPLRDLIEKIGGWNITGPWDQDN 
FMEVLKAVAGTYRATPFFTVYISADSKSSNSNVIQVDQSGLFLPSRDYYLNRTANEKVLTAY 
LDYMEELGMLLGGRPTSTREQMQQVLELEIQLANITVPQDQRRDEEKIYHKMSISELQALAP 
SMDWLEFLSFLLSPLELSDSEPVWYGMDYLQQVSELINRTEPSILNNYLIWNLVQKTTSSL 
DRRFESAQEKLLETLYGTKKSCVPRWQTC I SNTDDALGFALGSLFVKATFDRQS KE I AEGMI 
SEIRTAFSEALGQLVWMDEKTRQAAKEKADAIYDMIGFPDFILEPKELDDVYDGYEISEDSF 
FQNMLNLYNFSAKVMADQLRKPPSRDQWSMTPQTVNAYYLPTKNEIVFPAGILQAPFYARNH 
PKALNFGG IGWMGHELTHAFDDQGREYDKEGNLRPWWQNE S LAAFRNHTACMEEQYNQYQV 
NGERLNGRQTLGENITDNGGLKAAYNAYKAWLRKHGEEQQLPAVGLTNHQLFFVGFAQVWCS 
VRTPESSHEGLVTDPHSPARFRVLGTLSNSRDFLRHFGCPVGSPMNPGQLCEVW 



Type II Transmembrane domain: 

amino acids 32-57 



FIGURE 226 



GCCCGGCCCTCCGCCCTCCGCACTCCCGCCTCCCTCCCTCCGCCCGCTCCCGCGCCCTCCTCCCTCCCTCCTCCC 

CAGCTGTCCCGTTCGCGTCATGCCGAGCCTCCCGGCCCCGCCGGCCCCGCTGCTGCTCCTCGGGCTGCTGCTGCT 

CGGCTCCCGGCCGGCCCGCGGCGCCGGCCCAGAGCCCCCCGTGCTGCCCATCCGTTCTGAGAAGGAGCCGCTGCC 

CGTTCGGGGAGCGGCAGGTAGGTGGGCGCCCGGGGGAGGCGCGGGCGGGGAGTCGGGCTCGGGGCGAGTCAGCGC 

CAGCCCGGAGGGGGCGCGGGGCGCAGGTGGCTCGGCGCGGCGGGCGGCCCGGAGGGTGGGCGGGGGCAGAAGGGC 

GCGGTGCCTGGGACCCGGGACCCGCGGGCAGCCCCCGGGGCGGCACACGGCGCGAGCTGGGCAGCGGCCTCCAGC 

CAAGCCCGTCCCCGCAGGCTGCACCTTCGGCGGGAAGGTCTATGCCTTGGACGAGACGTGGCACCCGGACCTAGG 

GGAGCCATTCGGGGTGATGCGCTGCGTGCTGTGCGCCTGCGAGGCGCAGTGGGGTCGCCGTACCAGGGGCCCTGG 

CAGGGTCAGCTGCAAGAACATCaAACCAGAGTGCCaiACCCCGGCCTGTGGGCAGCCGCGCaSiGCTC^ 

CTGCTGCCAGACCTGCCCCCAGGACTTCGTGGCGCTGCTGACAGGGCCGAGGTCGCAGGCGGTGGCACGAGCCCG 

AGTCTCGCTGCTGCGCTCTAGCCTCCGCTTCTCTATCTCCTACAGGCGGCTGGACCGCCCTACCAGGATCCGCTT 

CTCAGACTCCAATGGCAGTGTCCTGTTTQAGCACCCTGCAGCCCCCACCCAAGATGGCCTGGTCTGTGGGGTGTG 

GCGGGCAGTGCCTCGGTTGTCTCTGCG6CTCCTTAGGGCAGAACAGCTGCATGTGGCACTTGTGACACTCACTCA 

CCCTTCAGGGGAGGTCTGGGGGCCTCTCATCCGGCACCGGGCCCTGTCCCCaGAGACCTTCAGTGCCATCCTGAC 

TCTAGAAGGCCCCCB.CCy^GCaGGGCGTAGGGGGCATCaCCCTGCTCACrCTCAGTQACACAGA 

TTTTTTGCTGCTCTTCCGAGGCCTTGCAGGACTAACCCAGGTTCCCTTGAGGCTCCAGATTCTACACCAGGGGCA 

GCTACTGCGAGAACTTCAGGCCAATGTCTCAGCCCAGGAACCAGGCTTTGCTGAGGTGCTGCCCAACCTGACAGT 

CCAGGAGATGGACTGGCTGGTGCTGGGGGAGCTGCAGATGGCCCTGGAGTGGGCAGGCAGGCCyiGGGCTGCGCAT 

CAGTGGACTVCATTGCTGCCAGGAAGAGCTGCGACGTCCTGCAAAGTGTCCTTTGTGGGGCTAATGCCCTQATCCC 

AGTCCaAACGGGTGCTGCCGGCTCaiGCCaGCCTCACTCTGCTAGQAAATGGCaiCCCTGATCCTCCAGGTGCT^ 

GGTAGGGACAACG^GTGAGGTGGTGGCCATQACACTGGAAACCAAGCCTaiGCGGAGGGATCAGCCCACTGT^^^ 

GTGCCACATGGCTGGCCTATCCTCCCCTGCCCCCAGGCCGTGGGTATCTGCCCTGGGCTGGGGTGCCCGAGGGGC 

TCATATGCTGCTGCAGAATGAGCTCTTCCTGJ^CGTGGGCACCAAGGACTTCCCAGACGGAGAGCTT 

ACGTGGCTGCCCTGCCCTACTGTGGGGCATAGCGCCCGCCCTGCCCGTGCCCCTAGCAGGAGCCCTGGTGCTACC 

CCCTGTGAAGAGCCAAGC7s.GCAGGGCACGCOTGGCTTTCCTTGGATACCCACTGTCaCCTGCaCTATGAAGTGCT 

GCTGGCTGGGCTTGGTGGCTCAGAACAAGGCACTGTCACTGCCCACCTCCTTGGGCCTCCTGGAACGCCAGGGCC 

TCGGCGGCTGCTGAAGGGATTCTATGGCTCAGAGGCCCAGGGTGTGGTGAAGGACCTGGAGCCGGAACTGCTGCG 

G<^CCTGGCAA2yVGGCaTGGCTTCCCTGATGATCACCACCAAGGTAGCCCCAGAGGGGAGCTCCGAGGGC^^ 

CTCCTCCCAGGTGCACATAGCCAACCaATGTGAGGTTGGCGGACTGCGCOTGGAGGCGGCCGGGGCCGAGGGGGT 

GCGGGCGCTGGGGGCTCCGGATACAGCCTCTGCTGCGCCGCCTGTGGTGCCTGGTCTCCCGGCCCTAGCGCCCGC 

CAAACCTGGTGGTCCTGGGCGGCCCCGAGACCCCAACaCATGCTTCTTCGAGGGGCyVGCyVGCGCCCCCACG^ 

TCGCTGGGCGCCCAACTACGACCCGCTCTGCTCACTCTGCACCTGCCAGAGACGAACGGTGATCTGTGACCCGGT 

GGTGTGCCCACCGCCCaGCTGCCCACACCCGGTGCAGGCTCCCGACCAGTGCTGCCCTGTTTGCCCTGGCTGCTA 

TTTTGATGGTGACCGGAGCTGGCGGGCAGCGGGTACGCGGTGGCACCCCGTTGTGCCCCCCTTTG6CTTAATTAA 

GTGTGCTGTCTGCACCTGCAAGCAGGGGGGCACTGGAGAGGTGCACTGTGAGAAGGTGCAGTGTCCCCGGCTGGC 

CTGTGCCCAGCCTGTGCGTGTCAACCCCACCGACTGCTGCAAACAGTGTCCAGGTGAGGCCCACCCCCAGCTGGG 

GGACCCCATGCAGGCTGATGGGCCCCGGGGCTGCCGTTTTGCTGGGCAGTGGTTCCCAGAGAGTCAGAGCTGGCA 

CCCCTCAGTGCCCCCGTTTGGAGAGATGAGCTGTATCACCTGCAQATGTGGGGTAAGTGGGGAGCAGAGGCTTGT 

GTGAGGTGGGTACTGGGAGCCTGGTCTGGAGTAGGGAGACCTTCCCAGGGAGGTCCCTGAAGAAGCTGAAGGTCA 

CTGTGTCCCAGTGCCTCTGGGGGACACTCAGTGTCTGCTCrGTCTTGTACCAGGCAGGGGTGCCTCACTGTGAGC 

GGGATGACTGTTCACTGCCACTGTCCTGTGGCTCGGGGAAGQAGAGTCGATGCTGTTCCCGCTGCACGGCCCACC 

GGCGGCGTAAGTGAGGGAGTCCAGGGTCAGCAGCTGTGAGTGGAGGGCTCACCTGCCTGTGGQACTCCTGATCAG 

GGAAGGGAGCACTCACTGTGTGCAGGAACAGTGCAGCCTGCCTCACAAGTGCCATTCCAATCCACCCTCACAGCA 

ACCTGGTGG?UkTTGTTATTTATGACCTTTTCTTTAa^TGAGATTTCTGAAGCTCAGAGAAATTAAGa^CG^^ 

ATGAAGGTCACCCAGCTGTGTGCACTGACCTGTTTAGAAAATACTGGCCTTTCTGGGACCAAGGCAGGGATGCTT 

TGCCCTGCCCTCTATGCCTCTCTGTGCCTCTCCACTCCCTCTCCCCTCCTCCAACATTCCCTCCCTTCTGTCTCC 

AGCAGCCCCAGAGACCAGAACTGATCCAGAGCTGGAGAAAGAAGCCGAAGGCTCTTAGGGAGCAGCCAGAGGGCC 

AAGTGACCAAGAGGATGGGGCCTGAGCTGGGGAAGGGGTGGCATCGAGGACCTTCTTGCATTCTCCTGTGGGAAG 

GAGAGGCAGCTGGGCCAQACCGAGGTCACAGCCACTCCAAGTCCTGCCCTGCCACCCTCGGCCTCTGTCCTGGAA 
GCCCCACCCCTTTCTTCCTGTACATAATGTCACTGGCTTGTTGGGATTTTTAATTTATCTTCACTCAGCACCAAG 
GGCCCCGGACACTCCACTCCTGCTGCCCCTGAGCTGAGCAGAGTCATTATTGGAGAGTTTTGTATTTATTAAAAC 
ATTTCTTTTTCAGTCTTTGGGCATGAGGTTGGCTCTTTGTGGCCAGGAACCTGAGTGGGGCCTGGTGGAGAAGGG 
GCNGAGAGTAGGAGGTGAGAGAGAGGAGCTCTGACACTTGGGGAGCTGaAAGAGACCTGGAGAGGCAGAGGATAG 
CGTGGCNNTTGGCTGGCATNCCTGGGTTCCGCAGAGGGGCTGGGGATGGTTCTTGAGATGGTCTAGAGACTCAAG 
AATTTAGGGAAGTAGAAGCAGGATTTTGACTCAAGTTTAGTTTCCCACATCGCTGGCCTGTTTGCTGACTTCATG 
TTTQAAGTTGCTCCAGAGAGAGAATCAAAGGTGTCACCAGCCCCTCTCTCCCTCCTTCCCTTCCCTTCCCTTTCT 
TTCCCTCCCCTCCCCTCCCCTCCCCTCCCCTCC 



FIGURE 227 



GGCCGAGCGGGGGTGCTGCGCGGCGGCCGTGATGGCTGGTGACGGCGGGGCCGGGCAGGGGA 
CCGGGGCCGCGGCCCGGGAGCGGGCCAGCTGCCGGGAGCCCTGAATCACCGCCTGGCCCGAC 
TCCACCATGAACGTCGCGCTGCAGGAGCTGGGAGCTGGCAGCAACGTGGGATTCCAGAAGGG 
GACAAGACAGCTGTTAGGCTCACGCACGCAGCTGGAGCTGGTCTTAGCAGGTGCCTCTCTAC 
TGCTGGCTGCACTGCTTCTGGGCTGCCTTGTGGCCCTAGGGGTCCAGTACCACAGAGACCCA 
TCCCACAGCACCTGCCTTACAGAGGCCTGCATTCGAGTGGCTGGAAAAATCCTGGAGTCCCT 
GGACCGAGGGGTGAGCCCCTGTGAGGACTTTTACCAGTTCTCCTGTGGGGGCTGGATTCGGA 
GGAACCCCCTGCCCGATGGGCGTTCTCGCTGGAACACCTTCAACAGCCTCTGGGACCAAAAC 
CAGGCCATACTGAAGCACCTGCTTGAAAACACCACCTTCAACTCCAGCAGTGAAGCTGAGCA 
GAAGACACAGCGCTTCTACCTATCTTGCCTACAGGTGGAGCGCATTGAGGAGCTGGGAGCCC 
AGCCACTGAGAGACCTCATTGAGAAGATTGGTGGTTGGAACATTACGGGGCCCTGGGACCAG 
GACAACTTTATGGAGGTGTTGAAGGCAGTAGCAGGGACCTACAGGGCCACCCCATTCTTCAC 
CGTCTACATCAGTGCCGACTCTAAGAGTTCCAACAGCAATGTTATCCAGGTGGACCAGTCTG 
GGCTCTTTCTGCCCTCTCGGGATTACTACTTAAACAGAACTGCCAATGAGAAAGTAAGGAAC 
ATCTTCCGAACCCCCATCCCTACCCCTGGCTGAGCTGGGCTGATCCCTGTTGACTTTTCCCT 
TTGCCAAGGGTCAGAGCAGGGAAGGTGAGCCTATCCTGTCACCTAGTGAACAAACTGCCCCT 
CCTTTCTTTCTTCTTTTCTTCCTCCCTCCCTCCCTTTCTTCCCCTTTTCCTTCCTTCCTTCC 
TCTTATTCTTCTAGTAGGTTTCATAGACACCTACTGTGTGCCAGGTCCAGTGGGGGAATTCG 
GAGATATAAGTTTCCGAGCCATTGCCACAGGAAGCGTTCAGTGTCGATGGGTTCATGGACCT 
AGATAGGCTGATAACAAAGCTCACAAGAGGGTCCTGAGGATTCAGGAGAGACTTATGGAGCC 
AGCAAAGTCTTCCTGAAGAGATTGCATTTGAGCCAGGTCCTGTAG 



FIGURE 228 



ATGCCTACTACCTTCCAACTAAGAATGAGATCGTCTTCCCCGCTGGCATCCTGCAGGCCCCC 
TTCTATGCCCGCAACCACCCCAAGGCCCTGAACTTCGGTGGCATCGGTGTGGTCATGGGCCA 
TGAGTTGACGCATGCCTTTGATGACCAAGGGCGCGAGTATGACAAAGAAGGGAACCTGCGGC 
CCTGGTGGCAGAATGAGTCCCTGGCAGCCTTCCGGAACCACACGGCCTGCATGGAGGAACAG 
TACAATCAATACCAGGTCAATGGGGAGAGGCTCAACGGCCGCCAGACGCTGGGGGAGAACAT 
TGCTGACAACGGGGGGCTGAAGGCTGCCTACAATGCTTACAAAGCATGGCTGAGAAAGCATG 
GGGAGGAGCAGCAACTGCCAGCCGTGGGGCTCACCAACCACCAGCTCTTCTTCGTGGGATTT 
GCCCAGGTGTGGTGCTCGGTCCGCACACCAGAGAGCTCTCACGAGGGGCTGGTGACCGACCC 
CCACAGCCCTGCCCGCTTCCGCGTGCTGGGCACTCTCTCCAACTCCCGTGACTTCCTGCGGC 
ACTTCGGCTGCCCTGTCGGCTCCCCCATGAACCCAGGGCAGCTGTGTGAGGTGTGGTAGACC 
TGGATCAGGGGAGAAATGGCCAGCTGTCACCAGACCTGGGGCAGCTCTCCTGACAAAGCTGT 
TTGCTCTTGGGTTGGGAGGAAGCAAATGCAAGCTGGGCTGGGTCTAGTCCCTCCCCCCCACA 
GGTGACATGAGTACAGACCCTCCTCAATCACCACATTGTGCCTCTGCTTTGGGGGTGCCCCT 
GCCTCCAGCAGAGCCCCCACCATTCACTGTGACATCTTTCCGTGTCACCCTGCCTGGAAGAG 
GTCTGGGTGGGGAGGCCAGTTCCCATAGGAAGGAGTCTGCCTCTTCTGTCCCCAGGCTCACT 
CAGCCTGGCGGCCATGGGGCCTGCCGTGCCTGCCCCACTGTGACCCACAGGCCTGGGTGGTG 
TACCTCCTGGACTTCTCCCCAGGCTCACTCAGTGCGCACTTAGGGGTGGACTCAGCTCTGTC 
TGGCTCACCCTCACGGGCTACCCCCACCTCACCCTGTGCTCCTTGTGCCACTGCTCCCAGTG 
CTGCTGCTGACCTTCACTGACAGCTCCTAGTGGAAGCCCAAGGGCCTCTGAAAGCCTCCTGC 
TGCCCACTGTTTCCCTGGGCTGAGAGGGGAAGTGCATATGTGTAGCGGGTACTGGTTCCTGT 
GTCTTAGGGCACAAGCCTTAGCAAATGATTGATTCTCCCTGGACAAAGCAGGAAAGCAGATA 
GAGCAGGGAAAAGGAAGAACAGAGTTTATTTTTACAGAAAAGAGGGTGGGAGGGTGTGGTCT 
TGGCCCTTATAGGACC 



FIGURE 229 

CCCACGCGTCCGAGCCGCCCGAGAATTAGACACACTCCGGACGCGGCCAAAAGCAACCGAGA 
GGAGGGGAGGCAAAAACACCGAAAAACAAAAAGAGAGAAACAACACCCAACAACTGGGGTGG 
GGGGAAGAAAGAAAGAAAAGAAACCCACCCACCCACCAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAATCCTGTGGCGCGCCGCCTGGTTCCCGGGAAGACTCGCCAGCACCAGGGGG 
TGGGGGAGTGCGAGCTGAAAGCTGCTGGAGAGTGAGCAGCCCTAGCAGGGATGGAC ATGA TG 
CTGTTGGTGCAGGGTGCTTGTTGCTCGAACCAGTGGCTGGCGGCGGTGCTCCTCAGCCTGTG 
CTGCCTGCTACCCTCCTGCCTCCCGGCTGGACAGAGTGTGGACTTCCCCTGGGCGGCCGTGG 
ACAACATGATGGTCAGAAAAGGGGACACGGCGGTGCTTAGGTGTTATTTGGAAGATGGAGCT 
TCAAAGGGTGCCTGGCTGAACCGGTCAAGTATTATTTTTGCGGGAGGTGATAAGTGGTCAGT 
GGATCCTCGAGTTTCAATTTCAACATTGAATAAAAGGGACTACAGCCTCCAGATACAGAATG 
TAGATGTGACAGATGATGGCCCATACACGTGTTCTGTTCAGACTCAACATACACCCAGAACA 
ATGCAGGTGCATCTAACTGTGCAAGTTCCTCCTAAGATATATGACATCTCAAATGATATGAC 
CGTCAATGAAGGAACCAACGTCACTCTTACTTGTTTGGCCACTGGGAAACCAGAGCCTTCCA 
TTTCTTGGCGACACATCTCCCCATCAGCAAAACCATTTGAAAATGGACAATATTTGGACATT 
TATGGAATTACAAGGGACCAGGCTGGGGAATATGAATGCAGTGCGGAAAATGCTGTGTCATT 
CCCAGATGTGAGGAAAGTAAAAGTTGTTGTCAACTTTGCTCCTACTATTCAGGAAATTAAAT 
CTGGCACCGTGACCCCCGGACGCAGTGGCCTGATAAGATGTGAAGGTGCAGGTGTGCCGCCT 
CCAGCCTTTGAATGGTACAAAGGAGAGAAGAAGCTCTTCAATGGCCAACAAGGAATTATTAT 
TCAAAATTTTAGCACAAGATCCATTCTCACTGTTACCAACGTGACACAGGAGCACTTCGGCA 
ATTATACCTGTGTGGCTGCCAACAAGCTAGGCACAACCAATGCGAGCCTGCCTCTTAACCCT 
CCAAGTACAGCCCAGTATGGAATTACCGGGAGCGCTGATGTTCTTTTCTCCTGCTGGTACCT 
TGTGTTGACACTGTCCTCTTTCACCAGCATATTCTACCTGAAGAATGCCATTCTACA ATAAA 
TTCAAAGACCCATAAAAGGCTTTTAAGGATTCTCTGAAAGTGCTGATGGCTGGATCCAATCT 
GGTACAGTTTGTTAAAAGCAGCGTGGGATATAATCAGCAGTGCTTACATGGGGATGATCGCC 
TTCTGTAGAATTGCTCATTATGTAAATACTTTAATTCTACTCTTTTTTGATTAGCTACATTA 
CCTTGTGAAGCAGTACACATTGTCCTTTTTTTAAGACGTGAAAGCTCTGAAATTACTTTTAG 
AGGATATTAATTGTGATTTCATGTTTGTAATCTACAACTTTTCAAAAGCATTCAGTCATGGT 
CTGCTAGGTTGCAGGCTGTAGTTTACAAAAACGAATATTGCAGTGAATATGTGATTCTTTAA 
GGCTGCAATACAAGCATTCAGTTCCCTGTTTCAATAAGAGTCAATCCACATTTACAAAGAT6 
CATTTTTTTCTTTTTTGATAAAAAAGCAAATAATATTGCCTTCAGATTATTTCTTCAAAATA 
TAACACATATCTAGATTTTTCTGCTTGCATGATATTCAGGTTTCAGGAATGAGCCTTGTAAT 
ATAACTGGCTGTGCAGCTCTGCTTCTCTTTCCTGTAAGTTCAGCATGGGTGTGCCTTCATAC 
AATAATATTTTTCTCTTTGTCTCCAACTAATATAAAATGTTTTGCTAAATCTTACAATTTGA 
AAGTAAAAATAAACCAGAGTGATCAAGTTAAACCATACACTATCTCTAAGTAACGAAGGAGC 
TATTGGACTGTAAAAATCTCTTCCTGCACTGACAATGGGGTTTGAGAATTTTGCCCCACACT 
AACTCAGTTCTTGTGATGAGAGACAATTTAATAACAGTATAGTAAATATACCATATGATTTC 
TTTAGTTGTAGCTAAATGTTAGATCCACCGTGGGAAATCATTCCCTTTAAAATGACAGCACA 
GTCCACTCAAAGGATTGCCTAGCAATACAGCATCTTTTCCTTTCACTAGTCCAAGCCAAAAA 
TTTTAAGATGATTTGTCAGAAAGGGCACAAAGTCCTATCACCTAATATTACAAGAGTTGGTA 
AGCGCTCATCATTAATTTTATTTTGTGGCAGGTATTATGACAGTCGACCTGGAGGGTATGGA 
TATGGATATGGACGTTCCAGAGACTATAATGGCAGAAACCAGGGTGGTTATGACCGCTACTC 
AGGAGGAAATTACAGAGACAATTATGACAACTGAAATGAGACATGCACATAATATAGATACA 
CAAGGAATAATTTCTGATCCAGGATCGTCCTTCCAAATGGCTGTATTTATAAAGGTTTTTGG 
AGCTGCACTGAAGCATCTTATTTTATAGTATATCAACCTTTTGTTTTTAAATTGACCTGCCA 
AGGTAGCTGAAGACCTTTTAGACAGTTCCATCTTTTTTTTTAAATTTTTTCTGCCTATTTAA 
AGACAAATTATGGGACGTTTGTCAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 23 0 



MMLLVQGACCSNQWLAAVLLSLCCLLPSCLPAGQSVDFPWAAVDNMMVRKGDTAVLRCYLED 
GASKGAWLNRSS 1 1 FAGGDKWS VDPRVSI STLNKRDYSLQIQNVDVTDDGPYTCSVQTQHTP 
RTMQVHLWQVPPKIYDISNDMTWEGTNVTLTCLATGKPEPSISWRHISPSAKPFENGQYL 
DIYGITRDQAGEYECSAENAVSFPDVRKVKWVNFAPTIQEIKSGTVTPGRSGLIRCEGAGV 
PPPAFEWYKGEKKLFNGQQGI I IQNFSTRSILTVTNVTQEHFGNYTCVAANKLGTTNASLPL 
NPPSTAQYGITGSADVLFSCWYLVLTLSSFTSIFYLKNAILQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-31 

Transmembrane domain: 

amino acids 326-34 5 

N-glycosylation sites. 

amino acids 71-75, 153-157, 273-277, 284-288, 292-296, 305-309 

Casein kinase II phosphorylation site. 

amino acids 147-151, 208-212, 224-228 

Tyrosine kinase phosphorylation site, 

amino acids 178-186 

N-myristoylation sites. 

amino acids 7-13, 63-70, 67-73, 151-157, 239-245, 291-297, 
302-308, 319-325 

Myelin PC protein: 

amino acids 92-121 



FIGURE 231 



AGTGGTTCGATGGGAAGGATCTTTCTCCAAGTGGTTCCTCTTGAGGGGAGCATTTCTGCTGG 
CTCCAGGACTTTGGCCATCTATAAAGCTTGGCAATGAGAAATAAGAAAATTCTCAAGGAGGA 
CGAGCTCTTGAGTGAGACCCAACAAGCTGCTTTTCACCAAATTGCAATGGAGCCTTTCGAAA 
TCAATGTTCCAAAGCCCAAGAGGAGAAATGGGGTGAACTTCTCCCTAGCTGTGGTGGTCATC 
TACCTGATCCTGCTCACCGCTGGCGCTGGGCTGCTGGTGGTCCAAGTTCTGAATCTGCAGGC 
GCGGCTCCGGGTCCTGGAGATGTATTTCCTCAATGACACTCTGGCGGCTGAGGACAGCCCGT 
CCTTCTCCTTGCTGCAGTCAGCACACCCTGGAGAACACCTGGCTCAGGGTGCATCGAGGCTG 
CAAGTCCTGCAGGCCCAACTCACCTGGGTCCGCGTCAGCCATGAGCACTTGCTGCAGCGGGT 
AGACAACTTCACTCAGAACCCAGGGATGTTCAGAATCAAAGGTGAACAAGGCGCCCCAGGTC 
TTCAAGGTCACAAGGGGGCCATGGGCATGCCTGGTGCCCCTGGCCCGCCGGGACCACCTGCT 
GAGAAGGGAGCCAAGGGGGCTATGGGACGAGATGGAGCAACAGGCCCCTCGGGACCCCAAGG 
CCCACCGGGAGTCAAGGGAGAGGCGGGCCTCCAAGGACCCCAGGGTGCTCCAGGGAAGCAAG 
GAGCCACTGGCACCCCAGGACCCCAAGGAGAGAAGGGCAGCAAAGGCGATGGGGGTCTCATT 
GGCCCAAAAGGGGAAACTGGAACTAAGGGAGAGAAAGGAGACCTGGGTCTCCCAGGAAGCAA 
AGGGGACAGGGGCATGAAAGGAGATGCAGGGGTCATGGGGCCTCCTGGAGCCCAGGGGAGTA 
AAGGTGACTTCGGGAGGCCAGGCCCACCAGGTTTGGCTGGTTTTCCTGGAGCTAAAGGAGAT 
CAAGGACAACCTGGACTGCAGGGTGTTCCGGGCCCTCCTGGTGCAGTGGGACACCCAGGTGC 
CAAGG3TGAGCCTGGCAGTGCTGGCTCCCCTGGGCGAGCAGGACTTCCAGGGAGCCCCGGGA 
GTCCAGGAGCCACAGGCCTGAAAGGAAGCAAAGGGGACACAGGACTTCAAGGACAGCAAGGA 
AGAAAAGGAGAATCAGGAGTTCCAGGCCCTGCAGGTGTGAAGGGAGAACAGGGGAGCCCAGG 
GCTGGCAGGTCCCAAGGGAGCCCCTGGACAAGCTGGCCAGAAGGGAGACCAGGGAGTGAAAG 
GATCTTCTGGGGAGCAAGGAGTAAAGGGAGAAAAAGGTGAAAGAGGTGAAAACTCAGTGTCC 
GTCAGGATTGTCGGCAGTAGTAACCGAGGCCGGGCTGAAGTTTACTACAGTGGTACCTGGGG 
GACAATTTGCGATGACGAGTGGCAAAATTCTGATGCCATTGTCTTCTGCCGCATGCTGGGTT 
ACTCCAAAGGAAGGGCCCTGTACAAAGTGGGAGCTGGCACTGGGCAGATCTGGCTGGATAAT 
GTTCAGTGTCGGGGCACGGAGAGTACCCTGTGGAGCTGCACCAAGAATAGCTGGGGCCATCA 
TGACTGCAGCCACGAGGAGGACGCAGGCGTGGAGTGCAGCGTCTGACCCGGAAACCCTTTCA 
CTTCTCTGCTCCCGAGGTGTCCTCGGGCTCATATGTGGGAAGGCAGAGGATCTCTGAGGAGT 
TCCCTGGGGACAACTGAGCAGCCTCTGGAGAGGGGCCATTAATAAAGCTCAACATCATTGA 



FIGURE 232 

></usr/seqdb2/sst/DNA/Dnaseqs,full/ss.DNA68886 
xsubunit 1 of 1, 520 aa, 1 stop 
xMW: 52658, pi: 9.16, NXCS/T): 3 

MRNKKILKEDELLSETQQAAFHQIAMEPFEINVPKPKRRNGVNFSLAVVVIYLILLTAGAGL 
LWQVLNLQARLRVLEMYFLNDTLAAEDS PS FSLLQSAHPGEHLAQGASRLQVLQAQLTWVR 
VSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRD 
GATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGE 
KGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPG 
PPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPA 
GVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGR 
AEVYYSGTWGTICDDEWQNSDAIVFCRMLGySKGRALYKVGAGTGQIWLDNVQCRGTESTI.W 
SCTKNSWGHHDCSHEEDAGVECSV 

Transmembrane domain: 

amino acids 47-66 (type II) 

N-glycosylation sites. 

amino acids 43-47, 83-87, 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 432-440 

N-myristoylation sites. 

amino acids 41-47, 178-184, 253-259, 274-280, 340-346, 346-352, 
400-406, 441-447, 475-481, 490-496, 515-521 

Amidation site. 

amino acids 360-364 

Leucine zipper pattern. 

amino acids 56-78 

Speract receptor repeat 

amino acids 422-471, 488-519 



Clq domain proteins. 

amino acids 151-184, 301-334, 316-349 



FIGURE 233 



CCCACGCGTCCGAAGGCAGACAAAGGTTCATTTGTAAAGAAGCTCCTTCCAGCACCTCCTCT 
CTTCTCCTTTTGCCCAAACTCACCCAGTGAGTGTGAGCATTTAAGAAGCATCCTCTGCCAAG 
ACCAAAAGGAAAGAAGAAAAAGGGCCAAAAGCCAA AATGA AACTGATGGTACTTGTTTTCAC 
CATTGGGCTAACTTTGCTGCTAGGAGTTCAAGCCATGCCTGCAAATCGCCTCTCTTGCTACA 
GAAAGATACTAAAAGATCACAACTGTCACAACCTTCCGGAAGGAGTAGCTGACCTGACACAG 
ATTGATGTCAATGTCCAGGATCATTTCTGGGATGGGAAGGGATGTGAGATGATCTGTTACTG 
CAACTTCAGCGAATTGCTCTGCTGCCCAAAAGACGTTTTCTTTGGACCAAAGATCTCTTTCG 
TGATTCCTTGCAACAATCAATGAGAATCTTCATGTATTCTGGAGAACACCATTCCTGATTTC 
CCACAAACTGCACTACATCAGTATAACTGCATTTCTAGTTTCTATATAGTGCAATAGAGCAT 
AGATTCTATAAATTCTTACTTGTCTAAGACAAGTAAATCTGTGTTAAACAAGTAGTAATAAA 
AGTTAATTCAATCTAAAAAAAAAAAAA 



FIGURE 234 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA52758 
<subunit 1 of 1, 98 aa, 1 stop 
<MW: 11081, pi; 6.68, NX(S/T) : 1 

MKLMVLVFTIGLTLLLGVQAMPANRLSCYRKILKDHNCHNLPEGVADLTQIDVNVQDHFWDG 
KGCEMI CYCNFSELLCCPKDVFFGPKISFVI PCNNQ 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 72-76 

Tyrosine kinase phosphorylation site. 

amino acids 63-71 



FIGURE 235 

CCCACGCGTCCGCGGACGCGTGGGCTGGACCCCAGGTCTGGAGCGAATTCCAGCCTGCAGGG 
CTGATAAGCGAGGCATTAGTGAGATTGAGAGAGACTTTACCCCGCCGTGGTGGTTGGAGGGC 
GCGCAGTAGAGCAGCAGCACAGGCGCGGGTCCCGGGAGGCCGGCTCTGCTCGCGCCGA GATG 
TGGAATCTCCTTCACGAAACCGACTCGGCTGTGGCCACCGCGCGCCGCCCGCGCTGGCTGTG 
CGCTGGGGCGCTGGTGCTGGCGGGTGGCTTCTTTCTCCTCGGCTTCCTCTTCGGGTGGTTTA 
TAAAATCCTCCAATGAAGCTACTAACATTACTCCAAAGCATAATATGAAAGCATTTTTGGAT 
GAATTGAAAGCTGAGAACATCAAGAAGTTCTTACATAATTTTACACAGATACCACATTTAGC 
AGGAACAGAACAAAACTTTCAGCTTGCAAAGCAAATTCAATCCCAGTGGAAAGAATTTGGCC 
TGGATTCTGTTGAGCTAGCTCATTATGATGTCCTGTTGTCCTACCCAAATAAGACTCATCCC 
AACTACATCTCAATAATTAATGAAGATGGAAATGAGATTTTCAACACATCATTATTTGAACC 
ACCTCCTCCAGGATATGAAAATGTTTCGGATATTGTACCACCTTTCAGTGCTTTCTCTCCTC 
AAGGAATGCCAGAGGGCGATCTAGTGTATGTTAACTATGCACGAACTGAAGACTTCTTTAAA 
TTGGAACGGGACATGAAAATCAATTGCTCTGGGAAAATTGTAATTGCCAGATATGGGAAAGT 
TTTCAGAGGAAATAAGGTTAAAAATGCCCAGCTGGCAGGGGCCAAAGGAGTCATTCTCTACT 
CCGACCCTGCTGACTACTTTGCTCCTGGGGTGAAGTCCTATCCAGACGGTTGGAATCTTCCT 
GGAGGTGGTGTCCAGCGTGGAAATATCCTAAATCTGAATGGTGCAGGAGACCCTCTCACACC 
AGGTTACCCAGCAAATGAATATGCTTATAGGCGTGGAATTGCAGAGGCTGTTGGTCTTCCAA 
GTATTCCTGTTCATCCAATTGGATACTATGATGCACAGA?IGCTCCTAGAAAAAATG6GTGGC 
TCAGCACCACCAGATAGCAGCTGGAGAGGAAGTCTCAAAGTGCCCTACAATGTTGGACCTGG 
CTTTACTGGAAACTTTTCTACACAAAAAGTCAAGATGCACATCCACTCTACCAATGAAGTGA 
CGAGAATTTACAATGTGATAGGTACTCTCAGAGGAGCAGTGGAACCAGACAGATATGTCATT 
CTGGGAGGTCACCGGGACTCATGGGTGTTTGGTGGTATTGACCCTCAGAGTGGAGCAGCTGT 
TGTTCATGAAATTGTGAGGAGCTTTGGAACACTGAAAAAGGAAGGGTGGAGACCTAGAAGAA 
CAATTTTGTTTGCAAGCTGGGATGCAGAAGAATTTGGTCTTCTTGGTTCTACTGAGTGGGCA 
GAGGAGAATTCAAGACTCCTTCAAGAGCGTGGCGTGGCTTATATTAATGCTGACTCATCTAT 
AGAAGGAAACTACACTCTGAGAGTTGATTGTACACCGCTGATGTACAGCTTGGTACACAACC 
TAACAAAAGAGCTGAAAAGCCCTGATGAAGGCTTTGAAGGCAAATCTCTTTATGAAAGTTGG 
ACTAAAAAAAGTCCTTCCCCAGAGTTCAGTGGCATGCCCAGGATAAGCAAATTGGGATCTGG 
AAATGATTTTGAGGTGTTCTTCCAACGACTTGGAATTGCTTCAGGCAGAGCACGGTATACTA 
AAAATTGGGAAACAAACAAATTCAGCGGCTATCCACTGTATCACAGTGTCTATGAAACATAT 
GAGTTGGTGGAAAAGTTTTATGATCCAATGTTTAAATATCACCTCACTGTGGCCCAGGTTCG 
AGGAGGGATGGTGTTTGAGCTAGCCAATTCCATAGTGCTCCCTTTTGATTGTCGAGATTATG 
CTGTAGTTTTAAGAAAGTATGCTGACAAAATCTACAGTATTTCTATGAAACATCCACAGGAA 
ATGAAGACATACAGTGTATCATTTGATTCACTTTTTTCTGCAGTAAAGAATTTTACAGAAAT 
TGCTTCCAAGTTCAGTGAGAGACTCCAGGACTTTGACAAAAGCAACCCAATAGTATTAAGAA 
TGATGJ^TGATCAACTCATGTTTCTGGAAAGAGCATTTATTGATCCATTAGGGTTACCAGAC 
AGGCCTTTTTATAGGCATGTCATCTATGCTCCAAGCAGCCACAACAAGTATGCAGGGGAGTC 
ATTCCCAGGAATTTATGATGCTCTGTTTGATATTGAAAGCAAAGTGGACCCTTCCAAGGCCT 
GGGGAGAAGTGAAGAGACAGATTTATGTTGCAGCCTTCACAGTGCAGGCAGCTGCAGAGACT 
TTGAGTGAAGTAGCCTAAGAGGATTTTTTAGAGAATCCGTATTGAATTTGTGTGGTATGTCA 
CTCAGAAAGAATCGTAATGGGTATATTGATAAATTTTAAAATTGGTATATTTGAAATAAAGT 
TGAATATTATATATAA 



FIGURE 236 



>< /us r/ s eqdb2 / s s t /DNA/Dnaseqs . f ul 1/ ss . DNA5 2 7 5 6 
><s^lb^lnit 1 of 1, 750 aa, 1 stop 
xMW: 84305, pi: 6.93, NX(S/T): 10 

MmLLHETDSAVATARRPRWLCAGALVLAGGFFLLGFLFGWFIKSSNEATNITPKHlsMKAFL 
DELKAENIKKFLHNFTQIPHLAGTEQNFQLAKQIQSQWKEFGLDSVELAHYDVLLSYPNKTH 
PNYISIINEDGNEIFNTSLFEPPPPGYENVSDIVPPFSAFSPQGMPEGDLVYWYARTEDFF 
KLERDMKINCSGKIVIARYGKVFRGNKVKNAQLAGAKGVILYSDPADYFAPGVKSYPDGWNL 
PGGGVQRGNILNIiNGAGDPLTPGYPANEYAYRRGIAEAVGLPSIPVHPIGYYDAQKLLEKMG 
GSAPPDSSWRGSLKVPYNVGPGFTGNFSTQKVKMHIHSTNEVTRIYNVIGTLRGAVEPDRYV 
ILGGHRDSWVFGGIDPQSGAAWHEIVRSFGTLKKEGWRPRRTILFASWDAEEFGLLGSTEW 
AEENSRLLQERGVAYINADSSIEGNYTLRVDCTPLMYSLVHNLTKELKSPDEGFEGKSLYES 
WTKKSPSPEFSGMPRISKLGSGNDFEVFFQRLGIASGRARYTKNWETNKFSGYPLYHSVYET 
YELVEKF YDPMFKYHLTVAQVRGGMVFELANS I VLPFDCRDYAWLRKYADKI YS I SMKHPQ 
EMKTYSVSFDSLFSAVKNFTEIASKFSERLQDFDKSNPIVLRMMNDQLMFLERAFIDPLGLP 
DRPFYRHVIYAPSSHNKYAGESFPGIYDALFDIESKVDPSKAWGEVKRQIYVAAFTVQAAAE 
TLSEVA 

S ignal s eguence : 

amino acids 1-40 

N-glycosylation sites. 

amino acids 76-80, 121-125, 140-144, 153-157, 195-199, 336-340, 
459-463, 476-480, 638-642 

Tyrosine kinase phosphorylation sites. 

amino acids 363-372, 605-613, 606-613, 617-626 

N-myristoylation sites. 

amino acids 85-91, 168-174, 252-258, 256-262, 282-288, 335-341, 
360-366, 427-433, 529-535, 707-713 



